HIV targeted ribozymes

ABSTRACT

A synthetic RNA catalyst capable of cleaving an RNA substrate, the catalyst comprising a substrate binding portion and a &#34;hairpin&#34; portion. The invention also provides an engineered DNA molecule and a vector, each comprising a DNA sequence coding for an RNA catalyst according to the invention. The invention further comprises host cells transformed with the vectors of the invention which are capable of expressing the RNA catalyst. Finally, the invention provides a method of cleaving an RNA substrate which comprises contacting the substrate with a synthetic RNA catalyst according to the invention.

This invention was made in part with Government support under Grant No.DMB 8817576 awarded by the National Science Foundation and Grant No. RO1AI 29870 awarded by the National Institutes of Health. The Governmenthas certain rights in the invention.

This application is a divisional of U.S. Ser. No. 08/153,367, filed Nov.16, 1993 now U.S. Pat. No. 5,527,895, which was a continuation-in-partof U.S. Ser. No. 08/078,774, filed Jun. 17, 1993 which was acontinuation of U.S. Ser. No. 07/703,427, filed May 14, 1991, nowabandoned, which was a continuation-in-part of U.S. Serial No.07/577,658, filed Sep. 4, 1990, now abandoned, which was acontinuation-in-part of U.S. Ser. No. 07/409,666, filed Sep. 20, 1989,now abandoned, which was a continuation-in-part of U.S. Ser. No.07/247,100, filed Sep. 20, 1988, now abandoned. The contents of all ofwhich are hereby incorporated by reference into the subject application.

FIELD OF THE INVENTION

The present invention relates to an RNA catalyst which cleaves specificRNA sequences into a fragment having a 5' hydroxyl and a fragment havinga 2',3' cyclic phosphate. The products of the reaction described hereinresemble those resulting from the natural hydrolysis of RNA.

BACKGROUND OF THE INVENTION

Certain naturally occurring satellite, virusoid and viroid RNAs possessthe property of self-catalyzed cleavage. Self-cleavage has beendemonstrated in vitro for avocado sunblotch viroid (ASBV) (Hutchins, C.J., Rathjen, P. D., Forster, A. C. and Symons, R. H. (1986) NucleicAcids Res., 14: 3627-3640), satellite RNA from tobacco ringspot virus(sTRSV) (Prody, G. A., Bakos, J. T., Buzayan, J. M., Schneider, I. R.and Bruening, G. (1986) Science, 231: 1577-1580; Buzayan, J. M.,Gerlach, W. L. and Bruening, G. B. (1986) Proc. Natl. Acad. Sci. U.S.A.83: 8859-8862) and lucerne transient streak virus (VLTSV) (Forster, A.C. and Symons, R. H. (1987) Cell, 49: 211-220). These self-catalyzed RNAcleavage reactions share a requirement for divalent metal ions andneutral or higher pH and cleave target RNA sequences to give 5' hydroxyland 2',3'-cyclic phosphate termini (Prody, G. A., Bakos, J. T., Buzayan,J. M., Schneider, I. R. and Bruening, G. (1986) Science, 231: 1577-1580;Forster, A. C. and Symons, R. H. (1987) Cell, 49: 211-220; Epstein, L.M. and Gall, J. G. (1987) Cell, 48: 535-543; Buzayan, J. M. Gerlach, W.L., Bruening, G. B., Keese, P. and Gould, A. R. (1986) Virology, 151:186-199).

A "hammerhead" model has been proposed and accurately describes thecatalytic center of (+)sTRSV RNA, the (+) and (-) strands of ASBV andthe (+) and (-) strands of vLTSV (Forster, A. C. and Symons, R. H.(1987) Cell, 49: 211-220). The single exception is (-)sTRSV RNA whichdoes not fit the "hammerhead" model (Forster, A. C. and Symons, R. H.(1987) Cell, 49: 211-220; Buzayan, J. M., Gerlach, W. L. and Bruening,G. (1986) Nature, 323: 349-352; Buzayan, J. M., Hampel, A. and Bruening,G. B. (1986) Nucleic Acids Res., 14: 9729-9743), and the structure ofwhose catalytic center was unknown prior to the present invention. It istherefore understandable that the primary scientific focus has been onstudying the "hammerhead" consensus structure and, as regards sTRSV, onstudying the (+) strand.

Intermolecular cleavage of an RNA substrate by an RNA catalyst that fitsthe "hammerhead" model was first shown in 1987 (Uhlenbeck, O. C. (1987)Nature, 328: 596-600). The RNA catalyst was recovered and reacted withmultiple RNA molecules, demonstrating that it was truly catalytic.

Catalytic RNAs designed based on the "hammerhead" motif have been usedto cleave specific target sequences by making appropriate base changesin the catalytic RNA to maintain necessary base pairing with the targetsequences (Haseloff and Gerlach, Nature, 334, 585 (1988); Walbot andBruening, Nature, 334, 196 (1988); Uhlenbeck, O. C. (1987) Nature, 328:596-600; Koizumi, M., Iwai, S. and Ohtsuka, E. (1988) FEBS Lett., 228:228-230). This has allowed use of the catalytic RNA to cleave specifictarget sequences and indicates that catalytic RNAs designed according tothe "hammerhead" model may possibly cleave specific substrate RNAs invivo. (see Haseloff and Gerlach, Nature, 334, 585 (1988); Walbot andBruening, Nature, 334, 196 (1988); Uhlenbeck, O. C. (1987) Nature, 328:596-600).

However, catalytic RNAs such as those that were designed based on the"hammerhead" model have several limitations which restrict their use invitro and may forestall their use in vivo. For example, the temperatureoptimum for the reaction is 50°-55° C., which is well abovephysiological, and the kcat (turnover number) is only 0.5/min even at55° C. (Uhlenbeck, O. C. (1987) Nature, 328:596-600; Haseloff andGerlach, Nature, 334, 585 (1988)). In addition, the Km is 0.6 uM(Uhlenbeck, O. C. (1987) Nature, 32:596-600), meaning that the reactionrequires high concentrations of substrate which makes it difficult, ifnot impossible, for the catalytic RNA to cleave low levels of target RNAsubstrate such as would be encountered in vivo.

Cech et al. published application WO 88/04300 and U.S. Pat. No.4,987,071 also report the preparation and use of certain syntheticribozymes that have several activities, including endoribonucleaseactivity. The design of these ribozymes is based on the properties ofthe Tetrahymena ribosomal RNA self-splicing reaction. A temperatureoptimum of 50° C. is reported (page 39 of WO 88/04300; col. 20, lines4-5, of U.S. Pat. No. 4,987,071) for the endoribonuclease activity, andthe Km and kcat reported for this activity are 0.8 uM and 0.13/minute,respectively (Example VI, last paragraph).

In view of the above, there is a need for an RNA catalyst having a lowertemperature optimum, preferably near physiological temperatures, ahigher turnover number and a smaller Km and which can be engineered tocut specific target RNA substrates. Accordingly, based on the discoveryof a totally different structure disclosed hereinafter, it is an objectof the present invention to provide such an RNA catalyst. Other objectsand features of the invention will be in part apparent and in partpointed out. The invention, accordingly, comprises the products andmethods hereinafter described and their equivalents, the scope of theinvention being indicated in the appended claims.

SUMMARY OF THE INVENTION

The invention comprises a synthetic RNA catalyst capable of cleaving anRNA substrate which contains the sequence:

    5'-F.sub.1 -CS-F.sub.2 -3',

wherein,

CS is a cleavage sequence; and

F₁ and F₂ each is a sequence of bases flanking the cleavage sequence.

The catalyst comprises a substrate binding portion and a "hairpin"portion. The substrate binding portion of the catalyst has the sequence:

    3'-F.sub.4 -L.sub.1 -F.sub.3 -5'

wherein,

F₃ is a sequence of bases selected so that F₃ is substantially basepaired with F₂ when the catalyst is bound to the substrate;

F₄ is a sequence of bases selected so that F₄ is substantially basepaired with F₁ when the catalyst is bound to the substrate;

the sequences of F₃ and F₄ being selected so that each contains anadequate number of bases to achieve sufficient binding of the RNAsubstrate to the RNA catalyst so that cleavage of the substrate can takeplace; and

L₁ is a sequence of bases selected so that L₁ does not base pair with CSwhen the catalyst is bound to the substrate.

The "hairpin" portion is a portion of the catalyst that assumes ahairpin-like configuration when the substrate-catalyst complex ismodeled in two dimensions for minimum energy folding. The "hairpin"portion of the catalyst preferably has the sequence: ##STR1## wherein,P₁ and P₄ each is a sequence of bases, the sequences of P₁ and P₄ beingselected so that P₁ and P₄ are substantially base paired;

P₁ is covalently attached to F₄ ;

S₁ and S₂ each is a sequence of bases, the sequences of S₁ and S₂ beingselected so that S₁ and S₂ are substantially unpaired;

P₂ and P₃ each is a sequence of bases, the sequences of P₂ and P₃ beingselected so that P₂ and P₃ are substantially base paired; and

L₂ is a sequence of unpaired bases.

RNA catalysts according to the invention can cleave substrates of anylength or type as long as they contain an appropriate cleavage sequence.In particular, the catalysts can be used to cleave a specific sequencein naturally-occurring RNA having a cleavage sequence, as well as RNAswhich have been engineered to contain a cleavage sequence.

The invention further comprises an engineered DNA molecule and a vector,each of which comprises a DNA sequence that codes for an RNA catalystaccording to the invention. The invention also comprises a hosttransformed with the vector, the host being capable of expressing theRNA catalyst. In particular, hosts can be transformed with vectors that,when transcribed, will produce RNA catalysts which can cleave any RNA,native or foreign, found in the host. For example, hosts can betransformed with vectors that, when transcribed, produce RNA catalystswhich can regulate the expression of genes by cleaving messenger RNA orwhich act as anti-viral agents by cleaving viral RNA. Thus, theinvention has application in vitro and in vivo in prokaryotes andeukaryotes of plant or animal origin in regulating gene expression andfor controlling viral infections.

Finally, the invention includes a method of cleaving an RNA substratecomprising contacting the substrate with an RNA catalyst according tothe invention. The reaction is unique because it occurs underphysiological conditions, having a temperature optimum near 37° C., withvery favorable reaction parameters. The method can be practiced in vitroor in vivo. For instance, the method may be practiced in vivo in hostcells that have been transformed with a vector that codes for an RNAcatalyst according to the invention.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 shows the (-)sTRSV RNA substrate-catalyst complex that fits the"hairpin" model of catalytic RNA in accordance with the presentinvention (SEQ ID NOS 57-58).

FIG. 2 shows minimum energy folding of (-)sTRSV RNA (SEQ ID NO 59).

FIGS. 3A-3B shows the time course of catalysis of a substrate RNA by thecatalytic RNA.

FIGS. 4A-4B shows the Michaelis-Menten kinetics of the RNA catalyticreaction.

FIGS. 5A-5B shows the temperature dependence of the RNA catalyticreaction.

FIGS. 6A-6B shows the dependence of the rate of reaction onconcentration of catalytic RNA.

FIG. 7 shows the reaction properties of a smaller RNA substrate.

FIGS. 8A-8B shows the loss of catalytic activity when the terminal A atposition 175 or the terminal bases AU at positions 175 and 176 areremoved from the catalytic RNA.

FIG. 9 illustrates a mutated "hairpin" ribozyme with based 195-203 inthe catalytic RNA sequence removed (these bases are indicated by beingunderlined).

FIG. 10 illustrates a mutated "hairpin" ribozyme having bases AAA atpositions 203, 202 and 201 (shown in a box) changed to CGU respectively(written below AAA in the figure).

FIGS. 11A-11F show that there is no effect on catalytic activity whenbase A at position 49 in the substrate is changed to a G, U or C (SEQ IDNOS 60-62).

FIGS. 12A-12B shows that different target RNA sequences can be used aslong as the base pairing with the catalytic RNA in the regions flankingthe cleavage sequence is maintained (SEQ ID NOS 63,64).

FIGS. 13A-13B shows that an RNA sequence found in tobacco mosaic viruscan be cleaved at a specific site with the catalytic RNA of the presentinvention (SEQ ID NOS 65,66).

FIGS. 14A-14F show three substrates having sequences found in thesequence of the messenger RNA coding for chloramphenicol acetyltransferase. FIGS. 14A-14F also show the separation patterns onacrylamide gels of the reaction products obtained by reacting thesesubstrates with catalytic RNAs designed to base pair with the substratesin the regions flanking the AGUC cleavage sequence (SEQ ID NOS 67-69).

FIGS. 15A-15C shows the sequence of a substrate having a sequence foundin the sequence coding for the gag protein of the HIV-1 virus whichcauses AIDS. FIGS. 15A-15C also shows the separation patterns onacrylamide gels of the reaction products obtained by reacting thissubstrate with a catalytic RNA designed to base pair with the substratein the regions flanking the CGUC cleavage sequence of the substrate (SEQID NOS 67-69).

FIGS. 16A-16B shows the sequence of a substrate having a sequence foundin the sequence coding for the regulatory tat protein of the HIV-1virus. FIGS. 16A-16B also shows the separation patterns on an acrylamidegel of the reaction products obtained by reacting this substrate with acatalytic RNA designed to base pair with the substrate in the regionsflanking the UGUC cleavage sequence of the substrate (SEQ ID NO 70).

FIGS. 17A-17B shows the sequence of a substrate having four non-nativeU's added to the 3' end of the sequence of the native (-)sTRSV substrateshown in FIG. 1. FIGS. 17A-17B also shows the separation patterns on anacrylamide gel of the reaction products obtained by reacting thissubstrate with different concentrations of a catalytic RNA designed tobase pair with the substrate in the regions flanking the cleavagesequence of the substrate, including with the four non-native U's (SEQID NO 72).

FIG. 18 summarizes the sequence requirements for the target region ofthe substrate RNA. Only GUC is required for cleavage (SEQ ID NOS 73,74).

FIGS. 19A-19B shows the positions of base changes (open boxes) made inthe sequence of the catalytic RNA shown in FIG. 1 in order to prove theexistence of Helices 3 and 4 predicted by the "hairpin" model for(-)sTRSV RNA. FIGS. 19A-19B also shows the separation patterns onacrylamide gels of the reaction products obtained by reacting thevarious catalytic RNAs with substrate RNA S17.

FIGS. 20A-20B shows the RNA sequence of an auto-catalytic cassette thathas utility in terminating transcription at a very specific site. FIGS.20A-20B also shows the separation pattern on an acrylamide gel of thereaction products obtained when this catalyst was transcribed andcleaved autocatalytically (SEQ ID NO 75).

FIG. 21A-21B shows the positions of two base changes that were made inthe native (-)sTRSV catalytic RNA sequence shown in FIG. 1. FIGS.21A-21B also shows the separation patterns on acrylamide gels of thereaction products obtained by reacting these catalysts or catalyst R51(control gel) with substrate S17.

FIG. 22A shows a plasmid map illustrating pertinent features of vectorPHC-CAT which contains the CAT gene linked to the "hairpin"autocatalytic cassette of the invention. Also shown is the expected RNAtranscript of the illustrated region of pHC-CAT and the expected 5'fragment of the autocatalytic cleavage.

FIGS. 22B-22C shows the results of Northern blot analysis of RNAproduced by host cells transformed with pHC-CAT.

FIG. 23: Map of the pMHC-CAT mammalian expression vector (SEQ ID NO 76).

FIG. 24: Map of plasmid pHC19R (SEQ ID NO 77).

FIG. 25: Shows the results of S1 nuclease protection assay of RNA fromtransfected CHO cells and RNA controls.

FIG. 26: Map of viral vector constructions (CMV) with the hairpinautocatalytic cassette in the sense orientation (pCS101HC7) and theantisense orientation (pCS101HC9).

FIG. 27A: Shows the results of S1 nuclease protection assay of RNA frominfected turnip plants and RNA controls.

FIG. 27B: Results of Northern blot analysis of the RNA isolated fromcontrol and infected plants.

FIG. 27C: Results of PCR amplification of viral sequences from controland infected turnip plants.

FIGS. 27D and 27E: Results of Western immunoblot analysis of proteinextracts of turnips leaves from control and infected plants.

FIG. 28: HIV-1 target sequence.

FIG. 29: Conservation of the HIV-1 target sequence in various HIVisolates (SEQ ID NOS 78-81).

FIG. 30: Sequences of HIV-1 substrate ("SHIV") containing the conservedtarget sequence and of an engineered "hairpin" catalytic RNA ("RHIV")designed to cleave the substrate (SEQ ID NOS 82,83).

FIGS. 31A-31B and 32A-32B: Results of the cleavage of SHIV substrate RNAby RHIV catalytic RNA.

FIGS. 33A-33B: Sequence of longer HIV-1 transcript and results of itscleavage by RHIV.

FIGS. 34A-34B: Map of pHR and partial sequence (SEQ ID NO 84).

FIG. 35: Results of cleavage of substrate SHIV by RHIV and by catalyticRNA produced by T7 RNA polymerase transcription of pHR.

FIG. 36: Map of pMSGRHIV and partial sequence (SEQ ID NO 85).

FIG. 37: Map of plasmid pMRHPT and partial sequence (SEQ ID NO 86).

FIG. 38: Selection and testing scheme for RHGPT.

FIG. 39: Results of S1 nuclease assay for reduction of HGPRT mRNA in CHOcells transfected with pMRHPT and pMSG-dhfr.

FIG. 40: Map of the plasmid pMCATRCAT and partial sequence (SEQ ID NO87).

FIG. 41: Results of S1 nuclease assay for reduction of CAT mRNA in CHOcells transfected with pMCATRCAT and pMSG-dhfr.

FIGS. 42A-42C: Summary of mutagenesis experiments with the (-)sTRSV RNAsubstrate-catalyst complex (SEQ ID NOS 88,89).

FIG. 42D: A more refined secondary structure model for the (-)sTRSV RNAsubstrate-catalyst complex.

FIG. 43: Separation patterns on an acrylamide gel of the reactionproducts obtained by reacting substrate with "hairpin" catalytic RNAhaving the loop that closes the "hairpin" replaced by the sequenceGGAC(UUCG)GUCC (SEQ ID NO 1).

FIG. 44: Sequences of the HIV-1-specific active and disabled "hairpin"ribozymes.

FIG. 45: Sketch of acrylamide gel showing results of assay for thebinding to substrate of the HIV-1-specific active and disabled "hairpin"ribozymes.

FIG. 46: Sketch of GeneScreen Plus membrane showing results of dot blotassay for ribozyme RNA in total RNA extracted from HeLa cells.

FIG. 47A: Graph showing the effects of active and disabledHIV-1-specific ribozymes on HIV-1 expression in human HeLa cells. Theresults are presented as percent activation, a measure ofchloramphenicol acetyl transferase (CAT) activity relative to control(no ribozyme).

FIG. 47B: Graph showing the effects of active and disabledHIV-1-specific ribozymes on HIV-1 expression in human HeLa cells. Theresults are presented as percent activation, a measure of the amount ofp24 HIV-1 antigen relative to control (no ribozyme).

FIG. 48: Graph showing the specificity of the active HIV-1-specific"hairpin" ribozyme. The results are expressed as percent activity, ameasure of CAT activity relative to control (no ribozyme).

FIG. 49: Map of plasmid pdRHIV.

FIG. 50: Map of plasmid pCDLTR.

FIG. 51: Schematic representation of events occurring in HeLaT4+ cellstransfected with plasmids pdRHIV, pCDLTR and pTAT.

DETAILED DESCRIPTION OF THE DRAWINGS

FIG. 1. The "hairpin" model for RNA catalysis was developed bydetermining the minimum energy folding predicted by computer modeling ofthe catalytic complex containing the minimum size catalytic RNA andsubstrate RNA of (-)sTRSV RNA. It is this minimum energy folding whichis shown in FIG. 1. Two molecules are shown folded: (1) catalytic RNAwhich contains 50 bases of satellite RNA (224-175) and (2) substrate RNAwhich contains 14 bases of satellite RNA (53-40). The arrow representsthe site of cleavage.

The 50-base catalytic RNA and the 14-base substrate RNA are the "minimumsize" in the sense that reductions in their length result in asubstantial or total loss of catalytic activity as is shown in theExamples below. Thus, this length of (-)sTRSV catalyst sequence ispreferred to shorter lengths. Also, substrate RNA having at least thedegree of base-pairing with the catalyst exhibited by the 14-basesubstrate is preferred.

FIG. 2. Minimum energy folding of (-)sTRSV RNA. The molecule was foldedusing the Wisconsin RNA folding program (Zucker, M. and Stiegler, P.(1981) Nucleic Acids Res., 9: 133-148; Devereux, J., Haeberli, P. andSmithies, O. (1984) Nucleic Acids Res., 12: 387-395) with base numberscorresponding to (+)sTRSV (Buzayan, J. M., Gerlach, W. L., Bruening, G.B., Keese, P. and Gould, A. R. (1986) Virology, 151: 186-199). With thisnumbering scheme the 5'-3' direction of the molecule is with decreasingbase number. The minimum catalytic complex is identified. The substrateRNA sequence is between bases 53-40 and the catalytic RNA sequence isbetween bases 224-175. The arrow is the site of cleavage.

The folding identifies regions of expected base pairing and expectednon-base pairing, loops. This model does not preclude higher orderinteractions occurring between the loops.

FIGS. 3A-3B. Time course of substrate S17 cleavage by catalytic RNA R51.The reaction was carried out under standard conditions, which were 37°C. in 12 mM MgCl₂, 40 mM Tris pH 7.5 and 2 mM spermidine, for thefollowing times: lane 1, 30 sec; lane 2, 5 min; lane 3, 15 min; lane 4,30 min; lane 5, 60 min; lane 6, 90 min; lane 7, 150 min. Concentrationswere as follows: R51!=0.0032 uM and S17!=0.09 uM. RNA was separated on7M urea, 20% acrylamide gels, bands cut out and counted in the liquidscintillation counter for FIGS. 3A-17B and 19A-21B. Throughout thefigures, the designations 5'F and 3'F are the products of cleavage ofthe substrate and represent the resulting 5' fragments and 3' fragments,respectively.

FIGS. 4A-4B. Eadie Hofstee plot of catalytic RNA R51 cleavage ofsubstrate RNA S17. The reaction was carried out at 37° C. in 12 mMMgCl₂, 40 mM Tris pH 7.5 and 2 mM spermidine. Concentrations were asfollows: R51!=0.0004 uM and S17!=0.125 uM (lane 1), 0.0625 uM (lane 2),0.0417 uM (lane 3), 0.031 uM (lane 4), 0.021 uM (lane 5), 0.0156 uM(lane 6), 0.0078 uM (lane 7) and 0.0039 uM (lane 8).

FIGS. 5A-5B. Temperature dependence of the rate of cleavage of substrateRNA S17 by catalytic RNA R51. The reaction was carried out in 12 mMMgCl₂, 40 mM Tris pH 7.5 and 2 mM spermidine at 45° C. (lane 6), 41° C.(lane 5), 37° C. (lane 4), 33° C. (lane 3), 27° C. (lane 2) and 20° C(lane 1). The concentrations used were: R51!=0.003 uM and S17!=0.09 uM.R51 was unlabeled. The velocities shown in the graph in FIGS. 5A-5B werecalculated by the use of time points of 8 and 16 minutes. The separationpatterns of the reaction products on the acrylamide gel shown in thefigure are for the 16-minute time point.

FIGS. 6A-6B. Rate of cleavage of substrate RNA S17 at varyingconcentrations of catalytic RNA R51. The reaction was carried out at 37°C. in 12 mM MgCl₂, 40 mM Tris pH 7.5 and 2 mM spermidine for 40 min(lane 1 and 2), 20 min (lane 3), 10 min (lane 4) and 5 min (lane 5). Theconcentration of substrate used was 0.175 uM.

FIG. 7. Eadie Hofstee plot of catalytic RNA R51 cleavage of substrateRNA S10. The substrate S10 containing the RNA sequence: GACAGUCCUG wasprepared from a DNA template containing the T-7 promoter as described inExample 2. This substrate was mixed with the catalytic RNA, R51, fromExample 2 under standard conditions: 37° C. in 12 mM MgCl₂, 40 mM TrispH 7.5 and 2 mM spermidine for 10 min. Concentrations of substrate usedwere as follows: 0.115 uM, 0.77 uM, 0.038 uM, 0.029 uM, 0.014 uM. Theconcentration of catalytic RNA, R51, used was 1 nM. The line was fit bylinear regression analysis and intercept, kcat, and Michaelis constant,Km, calculated.

FIGS. 8A-8B. Removal of the terminal "A" at position 175 of thecatalytic RNA. The "A" at base position 175 (circled) was removed andthe resulting catalytic RNA, R50, reacted with substrate S17. Inaddition, two bases were removed, to give R49 which had both A175 andU176 removed. Concentration of substrate S17 was 0.3 uM and allcatalytic RNA concentrations were 4 nM. The reaction times were 20 minunder standard conditions. Lane 1 R51/S17; Lane 2 R50/S17; Lane 3R49/S17; Lane 4 S17 alone. A 75% lose of activity was seen with R50 andR49.

FIG. 9 illustrates a mutated "hairpin" ribozyme with bases 195-203 inthe catalytic RNA sequence removed (these bases are indicated by beingunderlined). When the underlined bases were removed and the adjacentbases ligated together, no catalytic activity was seen.

FIG. 10 illustrates a mutated "hairpin" ribozyme having bases AAA atpositions 203, 202 and 201 (shown in a box) changed to CGU respectively(written below AAA in the figure). When the boxed AAA bases werereplaced by 5'-CGU-3' bases, no catalytic activity was seen.

FIGS. 11A-11B. No effect on activity when base A49 in the substrate ischanged to a "G". The circled "A" base 49 in the substrate was changedto a "G" and no effect on activity was seen. The concentration of R51was 0.016 uM, S17!=0.4 uM, and S17(A→G)!=0.2 uM. Reaction under standardconditions was for 40 min. Lane 1 S17; Lane 2 S17/R51; Lane 3 S17(A→G);Lane 4 S17 (A→G)/R51.

FIGS. 11C-11D. No effect on activity was seen when base A49 in thesubstrate was changed to a "U" (S17(A→U)). The concentration ofsubstrate RNA S17 (A→U) used was 0.12 uM and the concentration of R51was 0.0065 uM. Reaction was for 60 minutes under standard conditions.The catalytic RNA was unlabeled.

FIGS. 11E-11F. No effect on activity was seen when base A49 in thesubstrate was changed to a "C" (S17(A→C)). The concentration ofsubstrate RNA S17 (A→C) used was 0.08 uM and the concentration of R51was 0.0065 uM. Reaction was for 60 minutes under standard conditions.The catalytic RNA was unlabeled.

FIGS. 12A-12B. Different target RNA sequences can be used as long as thebase pairing is maintained with the catalytic RNA. The C:G base pairpredicted by the "hairpin" model of the catalytic complex of (-)sTRSV,FIG. 1, was changed to a G:C base pair (circled) and activity wasmaintained. In this experiment the usual substrate S17 was not used;instead a new substrate was used with the exact same sequence except thefirst two vector bases GC were eliminated. The resulting sequence ofthis new substrate S15 was gUGACAGUCCUGUUU(SEQ ID NO 3). The substratecontaining the C50→G50 base change was S15(C→G) and the catalytic RNAwith the G214→C214 base change was R51(G→C). The reactions were rununder standard conditions for 60 min at R51!=0.07 uM, S15!=0.12 uM,S15(C→G)!=0.15 uM, R51(G→C)!=0.05 uM. Lane 1 R51/S15; Lane 2R51(G→C)/S15; Lane 3 S15; Lane 4 R51(G→C)/S15(C→G); Lane 5 S15(C→G).

FIGS. 13A-13B. An RNA sequence found in tobacco mosaic virus (TMV) canbe cleaved at a specific site. The substrate RNA shown is that beginningwith nucleotide #538 in the tobacco mosaic virus sequence. The catalyticRNA was synthesized to base pair with the TMV substrate RNA in the stemregions of the "hairpin" as shown by the circled base pairs. Reactionwas for 20 min under standard conditions with a catalytic RNAconcentration of 3 nM and a substrate concentration of 0.06 uM. Lane 1TMV substrate RNA only; Lane 2 TMV catalytic RNA/TMV substrate RNA.

FIGS. 14A-14F. The sequences of three substrate RNAs having sequencesfound in the messenger RNA for chloramphenicol acetyl transferase (CAT)are shown. They have 14-, 16- and 18-base long target sites, and thelength of the 3' regions flanking the AGUC cleavage sequence is extendedin substrates (B) and (C) as compared to substrate (A). Catalytic RNAsdesigned to base pair with the substrate RNAs in both the 3' and 5'regions flanking the cleavage sequence AGUC were synthesized. The openboxed bases are those which are different from those in the native(-)sTRSV substrate RNA sequence shown in FIG. 1.

FIGS. 15A-15C. This figure shows the sequence of a substrate RNA havinga sequence found in the region of the HIV-1 viral RNA which specifiesthe gag protein. A catalytic RNA was made whose sequence was designed sothat the catalyst would base pair with the substrate RNA in both the 3'and 5' regions flanking the cleavage sequence. The open boxed bases arethose which are different than those of the native (-)sTRSV sequenceshown in FIG. 1. The catalytic RNA cleaved the substrate RNA at thearrow.

FIGS. 16A-16B. Shown is the sequence of a substrate RNA having thesequence found at the beginning of the coding region for the regulatoryprotein tat of the HIV-1 virus. A catalytic RNA was made which wasdesigned so that it would base pair with the substrate RNA in both the3' and 5' regions flanking the UGUC catalytic cleavage sequence. Theopen boxes are bases which are different from those of the native(-)sTRSV substrate sequence shown in FIG. 1. Cleavage occurred at thearrow as shown.

FIGS. 17A-17B. Shown is the sequence of a substrate RNA having fournon-native bases (UUUU) added to the 3' end of the sequence of thenative (-)sTRSV substrate RNA shown in FIG. 1. A corresponding catalyticRNA was made whose sequence was designed so it would base pair with thesubstrate in both the 3' and 5' regions flanking the AGUC cleavagesequence. Cleavage rates with a constant catalytic RNA concentration andvarious concentrations of substrate RNA were determined by cutting outthe bands of the acrylamide gels, counting the radioactivity andplotting the data using Michaelis-Menton procedures to calculate Km andkcat.

FIG. 18. Summary of the sequence requirements for the target region ofsubstrate RNA. Only a GUC sequence is required for cleavage of thesubstrate as long as the short sequences of bases in the regions of thesubstrate flanking the cleavage sequence are substantially base pairedwith corresponding regions of the RNA catalyst. The regions of basepairing are labeled Helix 1 and Helix 2 in the figure. Also, the regionsof base pairing predicted by the "hairpin" model for (-)sTRSV to existin the "hairpin" portion of the catalyst are labeled Helices 3 and 4.

FIGS. 19A-19B. Confirmation of the existence of Helices 3 and 4predicted by the "hairpin" model for (-)sTRSV RNA. A G→C base mutationin base 35 (count bases from the 5' end of the catalytic RNA sequenceshown) of the (-)sTRSV catalytic RNA sequence shown in FIG. 1 resultedin an RNA with no catalytic activity (Lanes 3 and 4 ("mismatch")). Adouble mutant, G35→C; C27→G had restored catalytic activity (Lanes 5 and6 ("substitute b.p.")). These two base changes are in the Helix 4 regionwhose existence is predicted by the "hairpin" model for (-)sTRSV. Also,a catalytic RNA having a single base change at position 47 (G47→C) wasinactive (Lanes 9 and 10), while a double mutant, with a second mutationC17→G, had restored activity (Lanes 11 and 12). These two base changesare in the Helix 3 region whose existence is predicted by the "hairpin"model. The control (Lanes 1, 2, 7 and 8) is cleavage of the substrateRNA S17 having the native (-)sTRSV sequence by catalytic RNA sequenceR51 having the native sequence.

FIGS. 20A-20B. The RNA sequence of a synthetic "hairpin" autocatalyticcassette is shown. The sequence shown in FIGS. 20A-20B is the same asthat of the catalyst shown in FIG. 1, but with additional 5' bases addedto form a loop at the 5' end of the catalyst and to provide a substratetarget sequence (i.e., a cleavage sequence and upstream and downstreamflanking bases) which can bind to the substrate binding portion of thecatalyst sequence. Such an RNA was prepared. When transcription wasperformed, the cassette autocatalytically cleaved at the expected siteto give the appropriate 3' F and 5' F products.

FIGS. 21A-21B. Shown are two base changes that were made in the native(-)sTRSV catalytic sequence shown in FIG. 1. The two bases changes werean "A" to "U" mutation at position 217 and a "G" to "C" mutation atposition 216. FIGS. 21A-21B also shows the separation patterns onacrylamide gels of the reaction products obtained by reacting one ofthese catalysts or R51 (control) with substrate S17. Both base changesproduced catalysts that were inactive when the catalysts were reactedwith substrate S17 under standard conditions for 15 minutes.

FIG. 22A shows a plasmid map illustrating pertinent portions of vectorpHC-CAT containing the CAT gene linked to the "hairpin" autocatalyticcassette of the invention so that the "hairpin" autocatalytic RNA wouldbe expected to serve as a chain terminator for the CAT gene. Also shownis the expected RNA transcript of the illustrated region ("CAT-cassetteRNA") and the expected 5' fragment of the autocatalytic cleavage("Cleaved CAT-cassette RNA"). Finally, the figure illustrates thelocation of DNA probes designed to hybridize with different regions ofthe CAT-cassette RNA transcript and Cleaved CAT-cassette 5' fragment.

FIGS. 22B-22C show the results of Northern blot analysis of RNA isolatedfrom Escherichia coli host cells transformed with pHC-CAT. When the CATprobe was used, both the full length transcript and the expected 5'cleavage fragment were detected, indicating that cleavage took place invivo. When the "hairpin" autocatalytic cassette probe was used, only thefull length CAT-cassette RNA transcript was detected. The fact that the5' fragment did not hybridize with this probe was to be expected, sincemost of the "hairpin" autocatalytic cassette transcript would be in the3' fragment after cleavage. Although, it would be expected that the"hairpin" autocatalytic cassette probe would hybridize to the 3'fragment, the fact that the 3' fragment was not detected by Northernblot analysis is not surprising. The 5' terminus of the 3' fragmentwould contain a 5'--OH and not the 5'-ppp which is ordinarily seen inRNA transcripts. Thus, the 3' fragment would be expected to be verylabile in vivo and was likely degraded immediately after theautocatalytic cleavage.

FIG. 23. Shown is the map of the vector pMHC-CAT which was constructedby excising the "hairpin" autocatalytic cassette from pHC (prepared asdescribed in Example 23) with SmaI/SalI and ligating it to vector pMSGat the SmaI/XhoI sites to give the vector PMHC. Then, the CAT gene wasexcised from pMAM-NEO-CAT with SmaI/XhoI and was ligated into pMHC atthe SmaI/XhoI sites to give pMHC-CAT as shown.

FIG. 24. Shown is the map of the vector pHC19R which was prepared byexcising the "hairpin" autocatalytic cassette from pHC with BamHI/SalIand ligating it to the SalI/BamHI sites of pTZ19R to give pHC19R asshown.

FIG. 25. Shown are the results of an S1 nuclease protection assayperformed on RNA isolated from CHO cells that had been transfected withpMHC-CAT. In the gel shown in FIG. 25, the lanes contain: Lane 1 - - -probe (148 nt); Lane 2 - - -S1 nuclease digested probe; Lane 3 - - - invitro transcribed "hairpin" autocatalytic cassette that had beenhybridized to probe and S1 nuclease digested (uncleaved 134 nt, 3' F 87nt, 5' F 47 nt); Lane 4- - - RNA isolated from uninduced,pMHC-CAT-transfected CHO cells, hybridized to probe and S1 nucleasedigested; and Lane 5 - - - RNA isolated from dexamethasone induced,pMHC-CAT-transfected CHO cells, hybridized to probe and S1 nucleasedigested (uncleaved 111 nt, 3' F 69 nt, 5' F 42 nt). All mobilities wereas expected.

FIG. 26. Shown are catalytic RNA/viral vector constructions used toinfect plants. In these constructions, the "hairpin" autocatalyticcassette from the vector pHC was ligated to the cauliflower mosaic virus(CMV) in vector pCS101 to give the two engineered CMV viral constructsshown. The "hairpin" autocatalytic cassette is in the sense orientationin pCS101HC7 and in the antisense orientation in pCS101HC9.

FIG. 27A: Shown are the results of an S1 nuclease protection assayperformed on RNA isolated from turnip plants that had been infected withpCS101HC7 or pCS101HC9. Lane 1 is undigested "hairpin" autocatalytic RNAprobe (148 nt) and Lane 2 is RNA transcribed from HindIII-digestedplasmid pHC which gave three RNA products--uncleaved "hairpin"autocatalytic cassette (158 nt), 3' F (87 nt) and 5' F (71 nt). Theremaining lanes are all S1 nuclease digests of probe hybridized to thefollowing RNA preparations: Lane 3 - - RNA from uninfected plants; Lane4 - - RNA from plants infected by the virus control (pCS101);

Lane 5 - - RNA from plants infected with pCS101HC7 (uncleaved 112 nt, 3'F 70 nt and 5' F 42 nt); Lane 6 - - RNA from plants infected withpCS101HC9.

FIG. 27B: Results of Northern blot analysis of the RNA isolated fromplants that were mock infected, infected with pCS101 (wild-type CMV) orinfected with pCS101HC7. The probe was labelled "hairpin" autocatalyticRNA. Lane 1, RNA from mock-infected plants; Lane 2, RNA from plantsinfected with pCS101 (wild-type CMV); and Lane 3, RNA from plantsinfected with pCS101HC7.

FIG. 27C: Total DNA from plants that were mock infected, infected withpCS101 (wild-type CMV) or infected with pCS101HC7 was amplified by thepolymerase chain reaction (PCR). Lane M, molecular weight markers; Lane1, PCR-amplified DNA from mock-infected plants; Lane 2, PCR-amplifiedDNA from plants infected with pCS101 (wild-type CMV); Lane 3,PCR-amplified DNA from plants infected with pCS101HC7; Lane 4,PCR-amplified DNA from the pCS101 plasmid; and Lane 5 PCR-amplified DNAfrom pCS101HC7 plasmid.

FIGS. 27D and 27E: Results of Western immunoblot analysis of proteinextracts of turnips leaves from plants that were mock infected, infectedwith pCS101 (wild-type CMV) or infected with pCS101HC7. FIG. 27D showsthe results for plants one month after inoculation, and FIG. 27E showsthe results for plants two months after inoculation. The lanes are thesame in both figures. Lane 1, protein from mock-infected plants; Lane 2,protein from plants infected with pCS101 (wild-type CMV); and Lane 3,protein from plants infected with pCS101HC7. CP=coat protein,(M)=molecular weight markers.

FIG. 28: HIV-1 target sequence. The 16-base target sequence is found inthe 5'-leader region of all 9 HIV-1 mRNAs. The nucleotide numberingstarts at the first base transcribed in the HXB2 clone (HIV SequenceData Base, prepared and distributed by Gerald Myers et al., Los AlamosNational Laboratory, Los Alamos, N. Mex., telephone (505) 665-0480).

FIG. 29: Conservation of the HIV-1 target sequence in various HIVisolates. All sequences are from the HIV Sequence Data Base and arelisted 5'→3'. All homologies are +107 to capsite (+1), except MAL whichis +105. The capsite sequence searched for was GGT CTC TCT. Only twoisolates (MAL and MN) had variations in the target sequence, and thevariations were, in each case, a single base change (G→A). In thefigure, cg indicates that the complete genome was contained in thesequence file, and "0 - - - " indicates proviral DNA without anyhomology which contained sequence information that started too late orended too soon. It is likely that a homologous sequence does indeedoccur in these strains.

FIG. 30: Sequence of HIV-1 substrate ("SHIV") having the target sequenceof FIG. 28 plus additional GCG vector bases at its 5' end. Also shown isthe sequence of engineered "hairpin" catalytic RNA ("RHIV") designed tocleave this substrate. The catalytic RNA also has additional 5' vectorbases 3'-CUGAGGG-5' as shown.

FIGS. 31A-31B: Time course of cleavage of the substrate RNA SHIV by RHIV(both depicted in FIG. 30). Shown are the catalytic RNA RHIV (R),substrate RNA SHIV (S), 3' cleavage fragment (3' F), and 5' cleavagefragment (5' F). Since 35% of the substrate was uncleavable, theremaining 65% was normalized to 100% on the ordinate of the graph.

FIGS. 32A-32B: Kinetics of the cleavage of the substrate RNA SHIV byRHIV (both depicted in FIG. 30) Shown are the catalytic RNA RHIV (R),substrate RNA SHIV (S), 3' cleavage fragment (3' F), and 5' cleavagefragment (5' F). The incubation time was 5 minutes, and theconcentration of RHIV was 0.005 uM. The concentration of SHIV was: Lane1--0.10 uM; Lane 2--0.05 uM; Lane 3--0.025 uM; Lane 4--0.012 uM; Lane5--0.006 uM; and Lane 6--0.025 uM (this is control lane at zero time).From the graph, the Km was found to be 100 nM, and the kcat to be1.6/min. From the time course shown in FIGS. 31A-31B, it was determinedthat 35% of the substrate was uncleavable. This was subtracted fromthese calculations.

FIGS. 33A-33B: Sequence of longer HIV-1 transcript and results of itscleavage by RHIV. Shown on the gel are the uncleaved transcript of 183ntand the two cleavage products (5' F of 111 nt and 3' F of 72 nt). Thecontrol lane at 0 minutes showed no cleavage. The gel was calibratedwith standards, and all mobilities were as expected.

FIGS. 34A-34B: Map of pHR and partial sequence.

FIG. 35: Results of cleavage of the substrate SHIV by the 101ntcatalytic RNA (designated as "PRHIV") produced by T7 RNA polymerasetranscription of pHR. Shown on the gel are RHIV (R), PRHIV (PR), SHIV(S), 3' cleavage fragment (3' F), and 5' cleavage fragment (5' F). Timesof incubation were 0 and 15 minutes.

FIG. 36: Map of pMSGRHIV and partial sequence.

FIG. 37: Map of plasmid pMRHPT. This mammalian expression vectorcontains DNA encoding an engineered "hairpin" catalytic RNA ("RHGPT")under control of the dexamethasone-inducible MMTV promoter andterminated by the "hairpin" autocatalytic cassette.

FIG. 38: Selection and testing scheme for RHGPT.

FIG. 39: Results of S1 nuclease assay for reduction of HGPRT mRNA in CHOcells transfected with pMRHPT and pMSG-dhfr. Lane 1 contains the P³²-labelled 148 nt probe which hybridizes to HGPRT mRNA; Lane 2 containsP³² -labelled "hairpin" autocatalytic RNA used as a standard (seen onlyon longer exposures); Lanes 3 and 4 contain S1-nuclease digested RNAisolated from cells transformed with pMRHPT and pMSG-dhfr and induced(Lane 4) or not induced (Lane 3) with dexamethasone. The arrow shows thelocation of the probe-protected RNA which corresponds to HGPRT mRNA.

FIG. 40: Map of the plasmid pMCATRCAT. This plasmid contains DNAencoding a "hairpin" RNA catalyst engineered to cleave CAT mRNAoperatively linked to the "hairpin" autocatalytic cassette, all drivenby the dexamethasone-inducible MMTV promoter. The CAT gene is on thesame plasmid and is driven by the SV40 promoter.

FIG. 41: Results of S1 nuclease assay for reduction of CAT mRNA in CHOcells transfected with pMCATRCAT and pMSG-dhfr. Lane 1 contains the P³²-labelled 119 nt probe which hybridizes to CAT MRNA; Lane 2 contains RNAfrom untransfected CHO cells; Lane 3 contains mRNA transcribed from theCAT gene on vector pHC-CAT in vitro; Lanes 4 and 5 contain S1-nucleasedigested RNA isolated from cells transfected with pMCATRCAT andpMSG-dhfr and induced (Lane 5) or not induced (Lane 4) withdexamethasone.

FIGS. 42A-C: Summary of mutagenesis experiments with the (-)sTRSV"hairpin" substrate-catalyst complex. Each base or combination of basesenclosed in a circle represents a separate mutational experiment.

FIG. 42D: A more refined secondary structure model for the (-)sTRSV"hairpin" substrate-catalyst complex.

FIG. 43: Separation patterns on an acrylamide gel of the reactionproducts obtained by reacting substrate with a "hairpin" catalytic RNAhaving the loop that closes the "hairpin" replaced by the hairpinsequence GGAC(UUCG)GUCC.

FIG. 44: Sequences of the HIV-1-specific active and disabled "hairpin"ribozymes.

FIG. 45: Sketch of acrylamide gel showing results of assay for thebinding to substrate of the HIV-1-specific active and disabled "hairpin"ribozymes. Lane 1 contains ³² P-labelled substrate alone with no addedribozyme. Lane 2 contains a ³² P-CTP-labelled active ribozyme used as amobility reference. Lane 3 contains unlabelled HIV-1-specific activeribozyme and a ³² P-CTP-labelled substrate (9% bound). Lane 4 containsunlabelled disabled ribozyme and a ³² P-CTP-labelled substrate (18%bound). Arrows indicate the mobility of (S) substrate, (R) ribozyme and(RS) the ribozyme/substrate complex which forms when active or disabledribozyme binds to substrate.

FIG. 46: Sketch of GeneScreen Plus membrane showing results of dot blotassay for ribozyme RNA in total RNA extracted from HeLa cells. Lane 1:RNA prepared from untransfected cells. Lane 2: RNA prepared from HeLacells transfected with plasmid pB-HR which codes for an HIV-1-specificactive "hairpin" ribozyme.

FIG. 47A: Graph showing the effects of active and disabledHIV-1-specific ribozymes on HIV-1 expression in human HeLa cells. Theresults are presented as percent activation, a measure ofchloramphenicol acetyl transferase (CAT) activity relative to control(no ribozyme).

FIG. 47B: Graph showing the effects of active and disabledHIV-1-specific ribozymes on HIV-1 expression in human HeLa cells. Theresults are presented as percent activation, a measure of the amount ofp24 HIV-1 antigen relative to control (no ribozyme).

FIG. 48: Graph showing the specificity of the active HIV-1-specific"hairpin" ribozyme. The results are expressed as percent activity, ameasure of CAT activity relative to control (no ribozyme).

FIG. 49: Map of plasmid pdRHIV.

FIG. 50: Map of plasmid pCDLTR.

FIG. 51: Schematic representation of events occurring in HeLaT4+ cellstransfected with plasmids pdRHIV, pCDLTR and PTAT. The tat protein codedfor by PTAT is expressed and will activate expression of the CAT gene bytransactivation of the HIV-1 LTR promoter in pCDLTR. When expression ofthe ribozyme coded for by pdRHIV is induced by dexamethasone, the CATtranscript (which contains HIV-1 LTR sequences, including the ribozyme'starget sequence) will be cleaved, removing the 5' cap. As a result,production of CAT enzyme activity will be reduced.

DESCRIPTION OF THE PRESENTLY PREFERRED EMBODIMENTS

An RNA catalyst has been identified comprising an RNA sequence which canbe engineered to cut, with great precision, target RNAs having acleavage sequence. In particular, the invention comprises certainsynthetic RNA catalysts capable of cleaving an RNA substrate whichcontains the target sequence 5'-F₁ -CS-F₂ -3'.

"Synthetic RNA catalyst," as used herein, means a catalyst which is nota naturally-occurring RNA catalyst, although "synthetic catalysts" maybe truncated or altered versions of naturally-occurring catalysts."Synthetic catalysts" are catalysts designed according to the principlesset forth herein based on the "hairpin" model to bind to and cleave aselected target sequence in a selected RNA substrate. "Syntheticcatalysts" are also sometimes referred to herein as "engineeredcatalysts." "Synthetic catalysts" include catalysts synthesized in vitroand catalysts synthesized in vivo. In particular, "synthetic catalysts"include catalysts produced by hosts transformed by a vector comprising asequence coding for the catalyst.

RNA of any length and type may be used as the substrate as long as itcontains the 5'-F₁ -CS-F₂ -3' target sequence. In this formula, CS isthe cleavage sequence (i.e., a sequence of bases containing the site atwhich the catalyst cleaves the substrate). CS is a short sequence ofbases which does not base pair with the RNA catalyst, and CS preferablyhas the sequence 5'-NGUC-3', wherein N is any base, and the substrate iscleaved by the catalyst between N and G to produce a fragment having anOH at the 5' end and a fragment having a 2', 3' cyclic phosphate at the3' end.

CS is flanked by two short base sequences F₁ and F₂ which do base pairwith the RNA catalyst. F₁ is preferably at least 3 bases in length, mostpreferably 4 bases in length. F₂ is also preferably at least 3 bases inlength, most preferably 6 to 12 bases in length.

Catalysts according to the invention comprise a substrate bindingportion and a "hairpin" portion. The substrate binding portion of thecatalyst has the sequence:

    3'F.sub.4 -L.sub.1 -F.sub.3 -5'

wherein,

F₃ is a sequence of bases selected so that F₃ is substantially basepaired with F₂ when the catalyst is bound to the substrate;

F₄ is a sequence of bases selected so that F₄ is substantially basepaired with F₁ when the catalyst is bound to the substrate;

the sequences of F₃ and F₄ being selected so that each contains anadequate number of bases to achieve sufficient binding of the RNAsubstrate to the RNA catalyst so that cleavage of the substrate can takeplace; and

L₁ is a sequence of bases selected so that L₁ does not base pair with CSwhen the catalyst is bound to the substrate.

As used herein, "substantially base paired" means that greater than 65%of the bases of the two RNA sequences in question are base paired, andpreferably greater than 75% of the bases are base paired. "Substantiallyunpaired" means that greater than 65% of the bases of the two sequencesin question are not base paired, and preferably greater than 75% of thebases are not paired.

F₃ is preferably at least 3 bases in length, most preferably from 6 to12 bases in length. F₄ is preferably from 3 to 5 bases in length, mostpreferably 4 bases in length.

L₁ is a short sequence of bases which preferably has the sequence5'-AGAA-3' when CS has the sequence 5'-NGUC-3'. Further, when L₁ is5'-AGAA-3' and CS is 5'-NGUC-3', then the first base pair between F₁ andF₄ adjacent to CS and L₁ is preferably G:C or C:G (see FIG. 42D andExample 32). Accordingly, a preferred target sequence in a selectedsubstrate contains the sequence 5'-SNGUC-3', wherein S is G or C.

The "hairpin" portion is a portion of the catalyst which folds into ahairpin-like configuration when the substrate-catalyst complex ismodeled in two dimensions for minimum energy folding. This is shown inFIGS. 1 and 42D for (-)sTRSV RNA. The "hairpin" portion is not anabsolute hairpin in the sense that not all bases of the "hairpin"portion are base-paired. Indeed, it is preferable, perhaps necessary,for the "hairpin" portion to have at least one substantially unpairedregion so that the catalyst can assume a tertiary structure that allowsfor better, or optimal, catalytic activity.

The "hairpin" portion of the catalyst preferably has the sequence:##STR2## wherein, P₁ and P₄ each is a sequence of bases, the sequencesof P₁ and P₄ being selected so that P₁ and P₄ are substantially basepaired;

P₁ is covalently attached to F₄ ;

S₁ and S₂ each is a sequence of bases, the sequences of S₁ and S₂ beingselected so that S₁ and S₂ are substantially unpaired;

P₂ and P₃ each is a sequence of bases, the sequences of P₂ and P₃ beingselected so that P₂ and P₃ are substantially base paired; and

L₂ is a sequence of unpaired bases.

"Substantially base paired" and "substantially unpaired" have the samemeanings as discussed above.

P₁ and P₄ each is preferably from 3 to 6 bases in length, and mostpreferably P₁ has the sequence 5-ACCAG-3' and P₄ has the sequence5'-CUGGUA-3'. It has been found that the A at the 5' end of 5'-ACCAG-3'(underlined) is not base paired to the U at the 3' end of 5'-CUGGUA-3'(underlined), and the unpaired A may act as a "hinge" (see Example 32).It is not known yet, but the presence of such a "hinge" may be necessaryfor catalytic activity.

S₁ and S₂ each preferably is from 4 to 9 bases in length, and mostpreferably S₁ has the sequence 5'-AGAAACA-3' and S₂ has the sequence5'-GUAUAUUAC-3'.

P₂ and P₃ each is preferably from 3 to 9 bases in length, and morepreferably P₂ has the sequence 5'-CAC-3' and P₃ has the sequence5'-GUG-3'.

Finally, L₂ is preferably at least 3 bases in length and preferably hasthe sequence 5'-GUU-3'. Further, 5'-S₁ -P₂ -L₂ -3' preferably has thesequence 5'-AGAAACACACGUU -3' (SEQ ID NO 4).

The specific preferred sequences set forth above for P₁, P₂, S₁, etc.,are from the catalytic sequence of (-) sTRSV RNA.

A preferred catalyst according to the invention contains the sequence:

    5'-F.sub.3 -L.sub.1 -F.sub.4 - ACCAGAGAAACACACGUUGUGGUAUAUUACCUGGUA- 3' ,(SEQ ID NO 5)

and active variants thereof, wherein F₃, F₄ and L₁ are as defined above.As used herein "active variants" means catalysts which, although havingsubstitutions, deletions and/or additions of bases as compared to theoriginal sequence, are still capable of cleaving an RNA substrate.

The most preferred sequence for 5'-P₂ -L₂ -P₃ -3' is5'-CACGGACUUCGGUCCGUG-3' SEQ ID 46! (see Example 32). Accordingly, themost preferred catalyst has the sequence:

    5'-F.sub.3 -L.sub.1 -F.sub.4 -ACCAGAGAAACACACGGACUUCGGUCCGUG-GUAUAUUACCUGGUA-3'         SEQ ID 47!

wherein F₃, F₄ and L₁ are as defined above and the underlined portion isthe preferred sequence for P₂ -L₂ -P₃.

Another preferred catalyst according to the invention is anautocatalytic catalyst containing the sequence: ##STR3## wherein, F₁,F₂, F₃, F₄, L₁, L₂, S₁,S₂, P₁, P₂, P₃ and P₄ are as defined above. L₃ isa sequence of unpaired bases, and L₃ preferably has the sequence3'-CCUCC-5'. Thus, the molecule contains a substrate portion (5'-F₁-CS-F₂ -3') and a catalytic portion (5'-F₃ -L₁ -F₄ -P₁ -S₁ -P₂ -L₂ -P₃-S₂ -P₄ -3') covalently linked together by L₃ so as to produce asynthetic autocatalytic RNA catalyst.

After being transcribed, this catalyst will spontaneously undergo anintramolecular autocatalytic cleavage at CS. The effect of thisintramolecular cleavage is to autocatalytically terminate any RNAtranscript in which the sequence is inserted. For instance, anengineered DNA molecule may be prepared which comprises a gene ofinterest and a DNA sequence coding for the catalyst linked so that, whentranscribed, the catalyst will terminate transcription of the messengerRNA coding for the gene product at a desired location.

The invention further provides an engineered DNA molecule and a vectorcomprising a DNA sequence coding for an RNA catalyst of the invention.Also provided are host cells which have been transformed with thevectors and which are capable of expressing the RNA catalyst. Finally,the invention provides a method of cleaving an RNA substrate whichcontains the sequence 5'-F₁ -CS-F₂ -3', the method comprising contactingthe substrate with a synthetic RNA catalyst according to the invention.

The invention is further described below with particular reference tothe catalytic properties of (-)sTRSV RNA and the structure of itscatalytic complex, but the invention is not limited to syntheticcatalysts based on (-)sTRSV RNA. In addition to the particular catalyticsequences shown and described below, other RNA molecules havingcatalytic activity to cleave an RNA substrate can be readily found byapplying the principles set forth in this specification.

For example, RNA sequences having the required structural features forcleaving an RNA substrate can be identified by applying the WisconsinRNA Folding Program discussed above to (1) known sequences of moleculeshaving catalytic or autocatalytic activity (especially molecules inwhich the actual location of the catalytic site is unknown), (2)randomly generated sequences having the proper pairing regions andlengths, and (3) randomly modified known catalytic sequences, whilelooking for known features of a catalytic molecule, such as the"hairpin" configuration of the catalytic complex when modeled in twodimensions. As a specific example, information regarding knownautocatalytic cleavage sites can be used to find substrate bindingsequences having the properties described above, such as having asubstrate binding sequence adjacent a "hairpin" portion. Secondaryfeatures, such as the two substantially paired regions in the "hairpin"portion, with an intermediate substantially unpaired region and anappropriate base loop can then be looked for, either by manualexamination or with automated computer programs. In order of decreasingpreference, the following features are considered important in selectinga catalytic sequence: (1) regions that can base pair (form helices) withthe regions of the substrate RNA molecule flanking the cleavagesequence; (2) an unpaired (loop) region opposite the cleavage sequenceof the substrate RNA; (3) two substantially base paired regions in a"hairpin" structure near the cleavage sequence; (4) a substantiallyunpaired region between these two substantially paired regions in the"hairpin" structure; and (5) a loop connecting the two strands of asubstantially base paired region to complete the "hairpin" structure.Standard techniques of in vitro RNA synthesis can then be used toprepare actual molecules having the sequence that gives the predictedtwo-dimensional computer-generated structure for verification ofactivity and routine testing of variation to determine optimum sequence.

The catalysts of the present invention were developed using a "hairpin"model or motif of RNA catalysis. According to this model, the catalyticcomplex, when modeled in two dimensions for minimum energy folding,assumes a "hairpin" configuration. This is shown in FIGS. 1 and 42D for(-)sTRSV RNA. The catalytic complex is a complex of the minimum, orsubstantially the minimum, sequence of the catalyst necessary foractivity and the minimum, or substantially the minimum, target sequenceof the substrate. The "hairpin" configuration is not an absolute hairpinin the sense that not all the bases that make up this "hairpin"configuration are base-paired. Indeed, there are preferably regions ofunpaired bases and of substantially unpaired bases as discussed indetail elsewhere in the present application.

The "hairpin" model has proved very useful in designing new catalysts,but it is still only a computer model of the likely secondary structureof catalytic complexes involving catalysts according to the presentinvention. Also, it is ultimately the tertiary structure of RNAcatalysts that determines their activity. For these reasons, allcatalysts having the properties described herein are considered to comewithin the scope of the present invention, even if they do not form a"hairpin" configuration when complexed with the substrate and even ifthey do not contain a "hairpin" portion. For instance, it may bepossible to engineer a catalyst having the properties described hereinwhich does not have a loop L₂. Such a catalyst would be considered to befully equivalent to the catalysts described and claimed herein.

As described in Example 1, a catalytic complex was identified within the(-) strand of tobacco ringspot virus satellite RNA (sTRSV) when themolecule was folded using computer models to determine the minimumenergy folding in two-dimensional space. The (-) strand is a 359 baselong RNA of defined sequence and is known to have autocatalyticproperties (Gerlach, W. L., Buzayan, J. W., Schneider, I. R. andBruening, G. B. (1986) Virology, 151: 172-185; Buzayan, J. M., Gerlach,W. and Bruening, G. (1986) Nature, 323: 349-352). The (-) strand cleavesat a defined site (ApG) into a cleavage product having an OH at the 5'end and a 2', 3' cyclic phosphate at the 3' end. Up until the presenttime, however, little work had been done with the (-) strand to find thecatalytic complex and to determine the minimum cleavage sequencesbecause it does not fit the consensus "hammerhead" model.

In view of the above and the fact that the catalytic center wouldcontain both the catalytic RNA sequence and the substrate (target) RNAsequence and by studying the results of Example 1, a 50 nucleotidesequence between bases 175 and 224 was picked and a 14 nucleotidesequence between bases 40 and 53 was picked. Using methodologies foundin published procedures, a catalytic RNA having a satellite RNA basesequence identical to the base sequence in naturally-occurring (-)sTRSVbetween bases 175 and 224 was transcribed from chemically synthesizedDNA templates using T7 RNA polymerase as described in Example 2. An RNAsubstrate having a satellite RNA base sequence identical to the basesequence in naturally occurring (-)sTRSV RNA between bases 40 and 53 wasalso prepared in the same manner. When the newly synthesized RNAs weremixed together under appropriate conditions as described in Example 3,the catalytic RNA cleaved the substrate RNA. As described in Example 4,the first RNA catalyst fitting the "hairpin" motif was discovered whenthe complex of the 50-base catalytic RNA and the 14-base substrate RNAwas modeled in two-dimensional space using computer modeling.

The reaction of catalysts fitting the "hairpin" motif with anappropriate substrate proved to be an excellent catalytic reaction underphysiological conditions. The reaction of the catalyst and substratecontaining the sequences of (-)sTRSV shown in FIG. 1 gave a Km of 0.03uM (see Example 5), which is 20 times smaller than that of the Km for acatalyst fitting the "hammerhead" model (Uhlenbeck, O. C. (1987) Nature328: 596-600) and accounts for its ability to remove target RNAmolecules to much lower levels (20 times lower) than that of catalystsfitting the "hammerhead" model. In addition, the kcat for the reactionis 2.1/min at 37° C., which is at least 6 times greater than that of acatalyst having the "hammerhead" configuration at the same temperature(see Example 5). These reaction parameters for a catalyst that fits the"hairpin" model can be optimized by adjusting the amount of base pairingbetween the substrate and catalyst (see Examples 18, 21 and 32). Thebest catalytic efficiencies obtained to date are for "hairpin" ribozymesspecific for the substrate sequences UGACAGUCCUGTUU (Example 5) andUGACAGUCCUGUUUUUUU (Example 21).

Catalytic cleavage of the substrate RNA occurs over a broad pH range,preferably 5.5 to 8.0, and in the presence of divalent ions such asMg++, e.g. from MgCl₂. As would be expected for a base catalyzedreaction, the rate of reaction increased with increasing pH. Thereaction rate also increased with increasing concentration of divalentcations as shown in Example 8.

The reaction takes place at physiological temperatures, preferably 16°C. to 45° C., with a temperature optimum at 37° C. as described inExample 6. Temperatures above about 45° C. inactivate the reaction.However, the temperature optimum of the reaction is affected by thedegree of base pairing between the substrate and catalyst (see Example18). In particular, the length of the region of the catalyst that basepairs with the 3' region of the substrate flanking the cleavage sequencecan be varied so that an engineered catalyst reacting at a desiredtemperature can be obtained (see Example 18). Further, a "hairpin"catalyst which is more thermal stable than the native (-)sTRSV catalystcan be prepared by deleting the loop that closes the "hairpin" (Loop IIIin FIG. 42D) and inserting therefor the stable hairpin sequence5-GGAC(UUCG)GUCC-3' SEQ ID 1! (see Example 32). This catalyst is highlyactive at 45° C.

The 50 base catalytic RNAs configured in the "hairpin" model in FIGS. 1and 42D have the minimal sequence, or substantially the minimalsequence, necessary to achieve catalysis. When the 3' terminal A (base175; see FIG. 1) or AU (bases 175 and 176; see FIG. 1) of this sequencewas removed, catalytic activity was substantially decreased as shown inExample 10. The 5' end of the molecule cannot be substantially changedeither without affecting catalytic activity because it is needed toprovide base pairing with the substrate. It can be shortened by at mostabout 3 bases (see Example 9). Experiments which removed bases 195-203in the center of the catalytic RNA and ligated base 194 to base 204produced an inactive catalytic RNA as described in Example 11. Thisshows that all or part of the region between bases 195-203 is essentialfor catalytic activity. An additional mutagenesis experiment to test thebase requirement of the three A bases at positions 203, 202 and 201 wasdone. When these bases were changed to CGU, respectively, as describedin Example 12, the resulting catalytic RNA was inactive.

Base changes can be made in the two base paired regions (Helices 3 and 4in FIG. 42D) of the "hairpin" portion of the (-)sTRSV catalytic RNA, aslong as substantial base pairing is maintained. This is shown inExamples 22 and 32 where base changes that destroyed base pairing inthese two regions resulted in inactive catalysts. When a second basechange was made which restored base pairing, the catalytic activity wasalso restored (see FIGS. 19 and 42A-C). The one exception appears to bethe first C:G base pair at the end of Helix 3 nearest the substratebinding portion (base pair C16:G48 in FIG. 42D). Currently availableevidence indicates that the identity of these two bases must bemaintained (see Example 32).

An active ribozyme is produced when Helix 4 is extended and the sequenceof the loop that closes the "hairpin" (Loop III in FIG. 42D) is changed.As shown in FIG. 42A, Loop III was replaced with the common and verystable RNA hairpin sequence 5'-GGAC(UUCG)GUCC-3'(SEQ ID NO 1). As aresult of this substitution, Helix 4 was extended by four base pairs andthe GUU sequence of Loop III was replaced with the sequence UUCG (seeFIG. 42A). The resulting RNA catalyst was more active and, as notedabove, more thermally stable than the unmutated form (see Example 32).It was concluded from this experiment that Loop III does not have aconserved or unique base sequence and that Helix 4 can be extended by atleast four base pairs without loss of activity.

However, the simple replacement of the GUU sequence of Loop III with thesequence UUCG gives an inactive ribozyme (see Example 32). It isbelieved that this shows that the sequence of Loop III has an influenceon the stability of Helix 4.

When Loop III is cut between U31 and U32, activity is lost (see FIG.42C). A likely explanation for this is that when Loop III is cut, Helix4 opens up and catalytic activity is, consequently, lost.

Base changes can also be made in the two regions of the catalytic RNAthat base pair with the substrate, as long as substantial base pairingwith the substrate in the regions flanking the cleavage sequence ismaintained and base pairing with the cleavage sequence is avoided asshown in Examples 9, 16-21 and 32. Indeed, every base pair in Helices 1and 2 (see FIG. 42D) can be changed to any other base pair, except thebase pair in Helix 2 adjacent to the NGUC cleavage sequence (see Example32). This base pair must be G:C or C:G and cannot be A:U or U:A.However, it is believed that this base pair is needed for stability andthat A:U or U:A base pairs can be used if other measures are taken tostabilize the substrate-catalyst interaction. Indeed, it has been foundthat A:U and U:A base pairs can be used at these positions in thesynthetic autocatalytic catalyst of the invention.

It is the ability to change the base pairs in Helices 1 and 2 thatallows the RNA catalyst to be engineered to cut a specific target RNAsubstrate having a cleavage sequence such as NGUC. This is illustratedin Example 16 where the catalytic RNA was engineered by changing base214 from a G to a C resulting in a catalytic RNA which failed to reactwith the substrate RNA developed from natural (-)sTRSV RNA. Activity wasrestored, however, when the substrate RNA was changed so that it couldbase pair with the subject engineered catalytic RNA. Also see Examples9, 17-21 and 32.

A type of "hinge" region, consisting of a single base, seems to bepresent between Helices 2 and 3 of the (-)sTRSV catalytic RNA (see FIG.42D and Example 32). The A at position 15 is not paired to the U atposition 49, and the unpaired A may act as a "hinge."

Further mutation studies have showed that the substantially unpairedregions between Helices 3 and 4 (Loops II and IV in FIG. 42D) are largerthan originally predicted by computer modeling and energy minimization(compare FIGS. 1 and 42D). Further, some of the bases in these loopsappear to be required for activity (see Example 32). In particular,changing bases C25 and A43 or deleting A26 in Loop II results in a lossof activity (see FIG. 42C and Example 32; also see Example 12). Also, inLoop IV, changing bases A38→G, A38→U, A40→G, A40→U, U42→C, and A43→U anddeleting U42 produced inactive ribozymes, as did the double mutationA43→U and U37→A (see Example 32).

The (-)sTRSV catalytic RNA sequence has an 5'-AGAA-3' sequence oppositethe AGUC cleavage sequence of the substrate. As shown in Examples 24 and32, at least part of this AGAA sequence is invariant. In particular,when the A's at the 3' and 5' ends of the sequence (AGAA) were changed,the resulting catalysts were active (see FIG. 42A). However, when G or A(AG A) in the center of the sequence was changed, the resultingcatalysts were inactive (see FIGS. 21 and 42C).

The target RNA substrate of the "hairpin" catalytic complex shown inFIG. 1 has an AGUC cleavage loop which does not base pair to thecatalytic RNA in two-dimensional space. As shown in Examples 13-15, "A"in the AGUC cleavage sequence can be changed to any other base withouteffecting the ability of the RNA catalyst to cleave the substrate.Example 25 shows that substrates having the GUC sequence are 6-100 timesmore susceptible to cleavage than are substrates in which one of thesebases has been changed. Thus, substrates having the cleavage sequenceNGUC are highly preferred.

Although there is no base pairing between the NGUC cleavage sequence andthe AAGA sequence in the catalyst opposite from the cleavage sequence,the C in NGUC apparently interacts with the A in AAGA opposite to it. Inparticular, it has been found that the single mutation of C→A in thecleavage sequence NGUC (position 9 in the substrate sequence--see FIG.42B) destroyed the ability of the RNA catalyst to cleave the substrate.When a second mutation was made in the ribozyme, changing the A atposition 7 to a C, partial activity was restored (see Example 32),indicating some sort of interaction between C at position 9 in thesubstrate and the A at position 7 in the catalyst (see FIG. 42B).

The cleavage sequence has four flanking bases at its 5' end and six atits 3' end which base pair with the catalytic RNA. As described above,the bases in the flanking regions can be changed (with the possibleexception of the G:C base pair in Helix 2 adjacent to NGUC) withoutaffecting the ability of the catalytic RNA to cleave the substrate, aslong as sufficient base pairing with the catalyst is maintained in theflanking regions. This would be expected to work on RNA substratesequences of any length as long as these criteria are met. Indeed,lengthening the 3' region of the substrate that base pairs with thecatalyst has been found to provide a more efficient catalytic reaction.See Examples 18, 21 and 32. However, a smaller 10 base substrate havingthree flanking bases at its 5' end and three flanking bases at its 3'end did not work as well as the 14-base substrate, as described inExample 9.

Using the "hairpin" model as a guide, RNA catalysts can be engineeredthat base pair with an RNA substrate and mediate a cleavage in thecleavage sequence. In particular, catalytic RNA can be engineered thatwill cleave any RNA substrate having a cleavage sequence, such as NGUC,and flanking regions with which the catalyst base pairs, so that thecatalytic RNA and RNA substrate form a catalytic complex in a "hairpin"motif. To do this, the bases flanking the cleavage sequence must beidentified and the catalytic RNA engineered so that it does not pair intwo-dimensional space with the cleavage sequence but does pair withadequate numbers of flanking bases upstream and downstream of thecleavage sequence. When designing a synthetic catalyst based on the(-)sTRSV catalyst, the other principles set forth herein regardingconserved or preferred sequences should also be taken into account indesigning the synthetic catalyst.

As shown in the Examples, catalytic RNAs according to the invention cancleave specific viral and messenger RNA sequences. In Example 15,tobacco mosaic virus (TMV) RNA containing the 5' coding region of thereplicase gene was targeted for specific cleavage by an appropriatelyengineered catalytic RNA. The target sequence contained changes in 8 ofthe 14 bases of the substrate RNA having a base sequence found withinthe catalytic complex of (-)sTRSV RNA and was cleaved by the engineeredRNA catalyst under conditions near physiological. Catalytic RNAs werealso designed and synthesized using the "hairpin" model as a guide whichcould cleave sequences from messenger RNA coding for chloramphenicolacetyl transferase (Example 18) and from HIV-1 viral RNA (Examples 19,20, 29, 33 and 34). In particular, a conserved sequence in HIV-1 viralmessenger RNAs has been identified, and a "hairpin" catalytic RNAdesigned which cleaves this sequence (see Example 29).

These examples demonstrate that the system can be used to specificallycleave an RNA sequence for which an appropriately engineered catalyticRNA base pairs at the designated flanking regions. Suitable target RNAsubstrates include viral, messenger, transfer, ribosomal, nuclear,organellar, other cellular RNA, or any other natural RNA having acleavage sequence, as well as RNAs which have been engineered to containan appropriate cleavage sequence.

Catalysts that fit the "hairpin" catalytic RNA model are useful in vivoin prokaryotes or eukaryotes of plant or animal origin for controllingviral infections or for regulating the expression of specific genes. Inthis case, a cleavage sequence such as NGUC in the virus orcomplementary to NGUC in the gene would need to be identified along withthe flanking sequences immediately upstream and downstream of thecleavage sequence. Normally three to four bases on the 5' side of thecleavage sequence and enough bases in the order of 6 to 12 on the 3'side to provide adequate binding of the catalytic RNA and to providereasonable certainty that the target RNA sequence is unique in theorganism are required.

A catalytic RNA is then engineered which does not base pair with thecleavage sequence and which does base pair to the flanking regions onthe 5' and 3' side of the cleavage sequence. A DNA templatecorresponding to this catalytic RNA is then synthesized using proceduresthat are well-known in the art. Such procedures include thephosphoramidite method (see, e.g., Beaucage and Caruthers, TetrahedronLetters, 22, 1859 (1981); Matteucci and Caruthers, Tetrahedron Letters,21, 719 (1980); and Matteucci and Caruthers, J. Amer. Chem. Soc., 103,3185 (1981)) and the phosphotriester approach (see, e.g., Ito et al.,Nucleic Acids Res., 10, 1755-69 (1982)).

This invention also encompasses catalytic compounds containing multiplehairpin ribozymes targetted to a particular gene or viral genome. Suchmultiple ribozymes would be particularly well suited for viral uses, forexample HIV-1. One particularly suitable multiple ribozyme would targetboth the HIV-1 leader sequence (111/112. site) and the HIV-1 polsequence (sequence ID Nos. 42 and 43).

The invention also includes an engineered DNA molecule and a vectorcomprising a DNA sequence coding for the desired synthetic RNA catalyst.The vector will have the DNA sequence coding for the desired catalyticRNA operatively linked to appropriate expression control sequences.Methods of effecting this operative linking, either before or after theDNA coding for the catalyst is inserted into the vector, are well known.Expression control sequences include promoters, activators, enhancers,operators, stop signals, cap signals, polyadenylation signals, and othersignals involved with the control of transcription.

The vector must contain a promoter and a transcription terminationsignal, both operatively linked to the synthetic DNA sequence, i.e., thepromoter is upstream of the synthetic DNA sequence and the terminationsignal is downstream from it. The promoter may be any DNA sequence thatshows transcriptional activity in the host cell and may be derived fromgenes encoding homologous or heterologous proteins and eitherextracellular or intracellular proteins, such as amylase, glycoamylases,proteases, lipases, cellulases, and glycolytic enzymes. Also, a promoterrecognized by T7 RNA polymerase may be used if the host is alsoengineered to contain the gene coding for T7 RNA polymerase.

The promoter may contain upstream or downstream activator and enhancersequences. An operator sequence may also be included downstream of thepromoter, if desired.

Expression control sequences suitable for use in the invention are wellknown. They include those of the E. coli lac system, the E. coli trpsystem, the TAC system and the TRC system; the major operator andpromoter regions of bacteriophage lambda; the control region offilamentous single-stranded DNA phages; the expression control sequencesof other bacteria; promoters derived from genes coding for Saccharomycescerevisiae TPI, ADH, PGK and alpha-factor; promoters derived from genescoding for the Asierqillus oryzae TAKA amylase and A. nigerglycoamylase, neutral alpha-amylase and acid stable alpha-amylase;promoters derived from genes coding for Rhizomucor miehei asparticproteinase and lipase; mouse mammary tumor promoter; SV40 promoter; theactin promoter; and other sequences known to control the expression ofgenes of prokaryotic cells, eukaryotic cells, their viruses, orcombinations thereof.

The vector must also contain one or more replication systems which allowit to replicate in the host cells. In particular, when the host is ayeast, the vector should contain the yeast 2 u replication genes REP1-3and origin of replication.

The vector should further include one or more restriction enzyme sitesfor inserting the DNA template sequences into the vector, and preferablycontains a DNA sequence coding for a selectable or identifiablephenotypic trait which is manifested when the vector is present in thehost cell ("a selection marker").

Suitable vectors for use in the invention are well known. They includeretroviral vectors, vaccinia vectors, pUC (such as pUC8 and pUC4K), pBR(such as pBR322 and pBR328), pTZ (such as pTZ18R), pUR (such as pUR288),phage lambda, YEp (such as YEp24) plasmids, and derivatives of thesevectors.

The resulting vector having the engineered DNA sequence that codes forthe RNA catalyst is used to transform an appropriate host. Thistransformation may be performed using methods well known in the art.

Any of a large number of available and well-known host cells may be usedin the practice of this invention. The selection of a particular host isdependent upon a number of factors recognized by the art. These include,for example, compatibility with the chosen expression vector, toxicityto it of the catalytic RNA encoded for by the engineered DNA sequence,rate of transformation, expression characteristics, bio-safety andcosts. A balance of these factors must be struck with the understandingthat not all hosts may be equally effective for the expression of aparticular catalytic RNA.

Within these general guidelines, useful hosts include bacteria (such asE. coli sp.), yeast (such as Saccharomyces sp.) and other fungi,insects, plants, animals (including human), or other hosts known in theart.

As an example of the general genetic engineering techniques that arepossible, mammalian vectors can be used to deliver DNA coding for thecatalytic RNAs of the invention to animal cells. These vectors shouldhave a suitable DNA replication signal, such as from SV40, a promoterwhich may or may not be inducible, such as the mouse mammary tumorpromoter (which is induced by dexamethasone) or the noninducible SV40promoter. A multiple cloning site is located after the promoter, and theDNA coding for the engineered catalytic RNA is inserted into thismultiple cloning site using standard techniques. If necessary a suitableterminator is inserted. The resulting vector is then put into cellsusing standard techniques such as direct injection into the nucleus,electroporation, or other well-known transformation techniques. Once thevector is in the cell, the catalytic RNA is expressed directly whennoninducible promoters are used, or after addition of the inducer wheninducible promoters are used.

Similarly, plant vectors, such as the Ti plasmid or micro-Ti plasmids,can be used to deliver DNA coding for a desired catalytic RNA to plantcells. The Ti plasmids and micro-Ti plasmids may be used as such totransform plant protoplasts using known techniques or may be insertedinto Agrobacterium tumefaciens which is then used to transform planttissue. Once the plasmid is in the cell, the catalytic RNA will beexpressed.

Once transformed, a host cell can express (transcribe) the synthetic RNAcatalyst. When the DNA coding for the catalyst is transcribed, itproduces catalytic RNA which will attack and cleave the target RNAsequence or sequences for which it has been designed, inactivating theRNA. If the RNA is necessary for the life cycle of a virus, the viruswill be eliminated and if the RNA is the product of a specific gene, theexpression of that gene will thus be regulated. The catalytic RNA can bedesigned to work in prokaryotes and within the nucleus (without poly(A)tail) or in the cytoplasm of a eukaryotic cell (with polyadenylationsignals in place) for plants and animals.

Another potential method of using the catalytic RNAs of the invention isto prepare stable synthetic derivatives of RNA catalysts designed tobind and cleave a specific target RNA and to deliver the modifiedcatalysts directly to the cell or organism of choice. For example,standard methods are available for making phosphorothioate derivativesof DNA which have been shown to be very stable in vivo and to be able tobind to a specific DNA or RNA target in vivo (antisense method). Amodification of these procedures can be used to prepare a catalyticallyactive derivative of RNA catalysts prepared according to the invention.This would entail determining which ribonucleotide regions can bealtered and then altering them with deoxy, phophorothio, or othermodifications which confer stability but do not destroy catalyticactivity. This chemically modified catalytic RNA (which may or may nothave any remaining RNA bonds) can then be injected or otherwisedelivered to an organism to control viruses or gene expression. Forinstance, one of the catalytic RNAs whose preparation is described inExamples 19-20 having specificity for sequences found within the RNA ofthe HIV-1 virus that causes AIDS could be chemically modified asdescribed, encapsulated in a liposome coated with monoclonal antibodydirected to the CD4 receptors found on cells susceptible to HIV-1, andinjected into a host animal.

The "hairpin" catalytic RNA model of the present invention may also beof possible interest to molecular biologists in exploring the geneticblueprints of plants and animals. This would be done by sending randomlyconstructed DNA reverse transcripts of catalytic RNA into the DNA of theorganism and waiting to see which gene or genes were inactivated. Othertechniques could be applied to determine where those genes resided onthe organism's chromosomes, thereby greatly accelerating gene mapping.

Finally, a synthetic autocatalytic RNA catalyst has been developed. Thesynthesis of one such catalyst based on the (-)sTRSV RNA catalytic andsubstrate sequences is described in Example 23. When the DNA coding forthis catalyst was transcribed in vitro, autocatalytic cleavage wasobserved. Further, in Example 26, DNA coding for this catalyst waslinked to the gene coding for chloramphenicol acetyl transferase (CAT)in vector pHC-CAT. This vector was then used to transform Escherichiacoli. Northern blot analysis of the RNA produced by the E. coli hostsindicated that autocatalytic cleavage occurred in vivo under standard E.coli growth conditions. Similar results were obtained when mammalian andplant cells were transformed with a vector coding for this autocatalyticRNA linked to either the gene coding for CAT (mammalian cells) or to DNAcoding for cauliflower mosaic virus (plants) (see Examples 27 and 28).The autocatalytic RNA catalyst has also been used to properly terminateother engineered RNA catalysts after they were transcribed. In thismanner, the engineered RNA catalysts were liberated from the RNAtranscript and acted in trans in vivo (see Example 31).

EXAMPLES

The following examples further illustrate the invention.

Example 1

The (-) sense sequence of satellite RNA from the budblight strain oftobacco ringspot virus as shown in FIG. 2 was folded using the WisconsinRNA folding program to identify the location of a possible catalyticcomplex accounting for its ability to self cleave (University ofWisconsin Genetics Computer Group, Program FOLD 5/6/86) (Zucker, M. andStiegler, P. (1981) Nucleic Acids Res., 9: 133-148; Devereux, J.,Haeberli, P. and Smithies, O. (1984) Nucleic Acids Res., 12: 387-395).Base numbers correspond to (+)sTRSV RNA (Buzayan, J. M., Gerlach, W. L.,Bruening, G. B., Keese, P. and Gould, A. R. (1986) Virology, 151:186-199). With this numbering scheme the 5'-3' direction of the moleculeis with decreasing base number.

The minimum catalytic complex, or active site of the molecule, isidentified in FIG. 2. The folding identified regions of expected basepairing which are in classical double-helical or stem regions. Thefolding also identified expected non-base pairing loops at or near thesite of cleavage. This model does not preclude higher order interactionsoccurring between non-adjacent portions of the catalytic center.

Example 2

A 50 nucleotide sequence between bases 175 and 224 was picked and a 14nucleotide sequence between bases 40 and 53 was picked from thecatalytic complex identified in Example 1. A catalytic RNA (R51) withthe 50 base sequence shown in FIG. 1 plus one additional vector base (Gat the 5' end) and a substrate RNA (S17) with the 14 base sequence shownin FIG. 1 plus three vector bases (GCG at the 5' end) were transcribedusing T7 RNA polymerase from synthetic DNA templates double stranded atthe promoter site (Milligan, J. F., Groebe, D. R., Witherell, G. W. andUhlenbeck, O. C. (1987) Nuc. Acids Res., 15: 8783-8798). The syntheticDNA templates were made using phosphoramidite chemistry on an AppliedBiosystems 381A DNA synthesizer. The template DNAs were: catalytic RNAR51:3'-ATTATGCTGAGTGATATCTTTGTCTCTTCAGTTGGTCTCTTTGTGTGCAACACCATATAATGGACCAT-5'(SEQID NO 8) and substrate RNA S17(SEQ ID NO 10):3'-ATTATGCTGAGTGATATCGCACTGTCAGGACAAA-5'(SEQ ID NO9).

Before transcription, a 18mer or 16mer DNA complement to the promoterfor T7 RNA polymerase on the noncoding strand was hybridized by heatingan equimolar amount of template DNA with promoter complement to 65° C.for 3 min. then placing in ice. A typical transcription reaction used 8ng/ul DNA template, 0.5 mM each NTP, 2 mM spermidine, 40 mM Tris pH 7.5,4% polyethylene glycol 6,000, 6 mM MgCl₂, 4 mM NaCl, 10 mMdithiothreitol, 0.01% Triton X-100, 2.4 units/ul RNasin, 1.8 uCi/ul P³²CTP and 3 units/ul T7 RNA polymerase (U.S. Biochemical) and was run at37° C. for 90 min.

All in vitro transcribed RNAs were isolated on 7M urea, 15-20%acrylamide gels, bands cut out and isolated. All RNAs were sequencedusing standard methods (Donis-Keller, H., Maxam, A. M. and Gilbert, W.(1980) Nucleic Acids Res., 4: 2527-2538); a method which also gave the5' terminal base. Terminal bases at the 3' end were determined byligation of the RNA to 5, P³² pCp using T4 RNA ligase (BRL methodsmanual), nuclease T2 digestion, and separation of labelled bases by PEIthin layer chromatography in 0.3M LiC1 with appropriate standards. AllRNA sequences corresponded to that expected from the DNA template.

Example 3

The catalytic RNA R51 was added to the substrate RNA S17 at a ratio of1:30 and the time course of substrate RNA cleavage was studied. Thereaction was carried out at 37° C. in 12 mM MgCl₂, 40 mM Tris pH 7.5 and2 mM spermidine over a time period of 150 min and is summarized asfollows: ##STR4##

The reaction products were separated on polyacrylamide/urea gels byelectrophoresis and bands cut out and counted in a liquid scintillationcounter. The results are shown in FIGS. 3A-3B. The time periods analyzedwere: lane 1, 30 sec; lane 2, 5 min; lane 3, 15 min; lane 4, 30 min;lane 5, 60 min; lane 6, 90 min and lane 7, 150 min. Beginningconcentrations were as follows: R51=0.0032 uM and S17=0.09 uM.

As shown in FIGS. 3A-3B, the cleavage proceeds to virtual completionduring the course of the reaction with only 2% of the substrateremaining after 150 minutes. This shows that, since there was originally30 times as much substrate RNA as catalytic RNA, the RNA catalyst R51 ofnecessity interacts with multiple substrate molecules during the courseof the reaction. In addition, the amount of catalyst remained the sameand was unaltered, indicating that R51 is truly a catalytic entity.

Example 4

After the RNA catalyst had been shown to be effective in cleaving theRNA substrate as described in Example 3, minimum energy folding of the50 base sequence shown in FIG. 1, complexed with the 14 base sequencewas done using the computer methods described in Example 1. The foldedcomplex forms a "hairpin" model or motif as shown in FIG. 1 with thesubstrate RNA sequence and the catalytic RNA sequence identified. Thearrow is at the site of cleavage.

Example 5

Various concentrations of substrate S17 were used at constantconcentration of catalyst R51 and initial velocities of each reactiondetermined. The reaction was carried out at 37° C. in 12 mM MgCl₂, 40 mMTris pH 7.5 and 2 mM spermidine. Concentrations were as follows:R51=0.0004 uM and S17=0.125 uM (lane 1), 0.0624 uM (lane 2), 0.0417 uM(lane 3), 0.031 uM (lane 4), 0.021 uM (lane 5), 0.0156 uM (lane 6),0.0078 uM (lane 7) and 0.0039 uM (lane 8). Each reaction was analyzed onpolyacrylamide gels as described in Example 3 with the results shown inFIGS. 4A-4B. An Eadie Hofstee plot of catalytic RNA R51 cleavage ofsubstrate RNA S17 is shown in FIGS. 4A-4B. The reaction proceededaccording to the predictions of the Michaelis-Menten equation indicatingthat it was a truly enzymatic reaction in that as the concentration ofthe substrate goes down, the velocity of the reaction goes down. Fromthe velocity of the reaction plotted as a function of substrateconcentration, the Km calculated from the graph was 0.03 uM and the kcat(turnover number) was 2.1/min. This Km is 20 times smaller than the"hammerhead" catalysts (see Uhlenbeck, O. C. (1987) Nature, 328:596-600) indicating that lower concentrations of substrate can beremoved. The kcat is 6 times larger than that of catalysts fitting the"hammerhead" model at 37° C. (see Uhlenbeck, O. C. (1987) Nature, 328:596-600), meaning that the reaction is 6 times faster.

Example 6

The temperature dependence of the rate of cleavage of substrate RNA S17by catalytic RNA R51 was tested over a temperature range and thereaction products analyzed on polyacrylamide gels as described inExample 3 with the results shown in FIGS. 5A-5B. The reaction wascarried out in 12 mM MgCl₂, 40 mM Tris pH 7.5 and 2 mM spermidine at 45°C. (lane 6), 41° C. (lane 5), 37° C. (lane 4), 33° C. (lane 3), 27° C.(lane 2) and 20° C. (lane 1). The concentrations used were: R51=0.0016uM and S17=0.04 uM. R51 was unlabeled. The velocities shown in the graphin FIGS. 5A-5B were calculated by the use of time points of 8 and 16minutes. The separation patterns on the gel shown in FIGS. 5A-5B are forthe 16-minute time point.

The reaction showed a temperature dependence similar to that which wouldbe expected of a reaction involving base paired RNA molecules. TheArrhenius plot of the data shown in FIGS. 5A-5B gives a temperatureoptimum of 37° C. for the reaction. Higher temperatures reduce thereaction rate with a very rapid rate reduction above 41° C. consistentwith a melting out of the catalytic RNA structure. At 50° C. no reactionwas detectable. The reaction rate at temperatures below 37° C. showed alinear reciprocal temperature dependence consistent with a classicallowering of the energy of activation for the reaction. The slope of theline in the Arrhenius plot gave an energy of activation of 19 Kcal/molewhich is close to that found for catalysts fitting the "hammerhead"cleavage mechanism (13.1 Kcal/mole) (Uhlenbeck, O. C. (1987) Nature,328: 596-600).

Example 7

The rate of cleavage of a constant concentration of substrate RNA S17 atvarying concentrations of catalytic RNA R51 was tested and the reactionproducts analyzed on polyacrylamide gels as described in Example 3 withthe results shown in FIGS. 6A-6B. The reaction was carried out at 37° C.in 12 mM MgCl₂, 40 mM Tris pH 7.5 and 2 mM spermidine for 40 min (lane 1and 2), 20 min (lane 3), 10 min (lane 4) and 5 min (lane 5). Theconcentration of substrate was 0.175 uM. The results are plotted inFIGS. 6A-6B and show that at saturating concentrations of substrate thereaction rate is linear with increasing RNA catalyst concentrations asone would expect for a true catalytic reaction.

Example 8

The effect of Mg⁺⁺ concentration and pH on the rate of cleavage of RNAsubstrate S17 by RNA catalyst R51 was determined as shown in thefollowing table:

    ______________________________________               t.sub.1/2  (min)    ______________________________________    MgCl.sub.2 (mM)    0            no detectable product    4            136    6            111    8            115    10           88    12           81    15           74    20           62    pH    5.5          330    6.0          120    6.5          67    7.0          48    7.5          42    8.0          38    ______________________________________

In the Mg⁺⁺ studies, the substrate S17 concentration was 0.14 uM and RNAcatalyst R51 concentration was 0.0015 uM. The reactions were at 37° C.in 40 mM Tris pH 7.5. In the pH studies, the substrate S17 concentrationwas 0.062 uM and RNA catalyst R51 concentration was 0.0014 uM. Thereactions were at 37° C. in 40mM Tris for pH 7.0, 7.5, 8.0 and in 40mMPipes for pH 5.5, 6.0 and 6.5.

The dependence of the reaction rate on Mg⁺⁺ and pH are virtuallyidentical with those of catalysts fitting the "hammerhead" model. Thereaction rate increases with increasing pH as one would expect for abase catalyzed reaction but the effect is masked by the catalyticactivity of the RNA. Hence a 100 fold increase in OH! between pH 6.0 and8.0 resulted in only a 3 fold increase in the reaction rate.

Example 9

A 10 base substrate (S10) was prepared by the methodology of Example 2.When the substrate was mixed with catalytic RNA R51, the reaction issummarized as follows: ##STR5##

The results of rate studies with substrate S10 comparable to thosedescribed with S17 in Example 5 showed a Km=0.06 uM and a kcat=0.8/min.These results are shown in FIG. 7 and indicate that smaller substratescan be used, but not as efficiently.

Example 10

The 3' terminal base of the catalyst shown in the "hairpin" model of the(-)sTRSV catalytic complex in FIG. 1 is at position 175. Two catalyticRNAs were prepared with exactly the same sequence as R51, except thatone of them did not contain the 3' terminal "A" base (position 175) andthe other one did not contain the 3' terminal "UA" sequence (positions176 and 175). Synthesis of these catalytic RNAs, designated R50 and R49,respectively, was carried out as described in Example 2. R50 or R49catalytic RNA was mixed with substrate RNA S17 under standard conditionsof reaction and the products analyzed as described in Example 3. Theresults are given in FIGS. 8A-8B and show a 75% reduction in activitywith either R50 or R49 as compared to the activity of catalytic RNA R51having the 3' terminal "A" at position 175 and the 3' terminal "AU" atpositions 175 and 176.

This experiment was repeated, and Km and kcat values measured. It wasfound that the activity (kcat/Km) of R50 was 5 times lower than that ofR51, and the activity of R49 was 20 times lower than that of R51. The Kmvalues of R50 and R49 were both in the 30-50 nM range, with the decreasein activity being almost entirely in the kcat values.

Replacement of the terminal A with G reduced activity 75% as compared toR51, which is the same reduction in activity seen when this base isremoved. Replacement of U49 with A gave a fully active ribozyme ascompared to R51.

Example 11

An RNA with the same sequence as catalytic RNA R51 was prepared, exceptthat bases 195-203 were omitted such that base 194 was in effect ligatedto base 204. This RNA molecule was prepared as described in Example 2from an appropriate DNA template containing the complementary sequence.When this RNA was mixed with substrate RNA S17 as described in Example3, no reaction occurred. These results show that major elements of the"hairpin" structure are required for RNA catalysis to occur and thatremoval of 9 bases (see FIG. 9) in the middle inactivates the catalyticRNA.

Example 12

An RNA with the same sequence as R51, except that the bases AAA atpositions 203, 202 and 201 were changed to CGU, respectively, wasprepared as described in Example 2 using an appropriate DNA primer. Whenthis potential RNA catalyst was mixed with substrate RNA S17 asdescribed in Example 3, no reaction occurred. This shows that theintegrity of one or all of these bases (see FIG. 10) is required forcatalytic activity.

Example 13

A substrate RNA with the base at position 49 in FIG. 1 changed from an"A" to a "G" was prepared as described in Example 2. When this substratewas reacted with the RNA catalyst R51, no difference in rate of reactionwas seen between this substrate and the substrate containing the "A" atposition 49 (see FIGS. 11A-11B). This shows that alterations can occurin the "A" base in the substrate RNA AGUC loop without affecting theability of the catalytic RNA to cleave the substrate.

Example 14

Another substrate RNA identical to S17 but having "A" replaced by "U" inthe AGUC loop was prepared as described in Example 2 (designated"S17(A→U)"). This substrate RNA, at a concentration of 0.12 uM, wasreacted with the catalytic RNA R51, at a concentration of 0.0065 uM,under standard conditions as described in Example 3 for 60 minutes. Theresults are shown in FIGS. 11C-11D where Lane 1 contains the products ofthe reaction of substrate RNA S17(A→U) with R51 catalytic RNA. Nodifference in the rate of reaction was seen between S17(A→U) substrateRNA and substrate S17 containing the "A" base at position 49.

Example 15

Another substrate RNA identical to S17 but having "A" replaced by "C" inthe AGUC loop was prepared as described in Example 2 (designated"S17(A→C)"). This substrate RNA, at a concentration of 0.08 uM, wasreacted with the catalytic RNA R51, at a concentration of 0.0065 uM,under standard conditions as described in Example 3 for 60 minutes. Theresults are shown in FIGS. 11E-11F where Lane 1 contains the products ofthe reaction of S17 (A→C) substrate RNA with R51 catalytic RNA. Again,no difference was seen in the rate of reaction using S17(A→C) ascompared to S17 containing the "A" base at position 49. The combinedresults of Examples 13-15 show that the base at position 49 in thecleavage sequence of the substrate may be any base.

Example 16

Base changes in the stem regions at the site of binding of the substrateRNA to the catalytic RNA in the "hairpin" configuration can be made aslong as the base pairing is maintained. The "C" base at position 50 ofthe substrate was changed to a "G" using the methods in Example 2. Whenthis substrate RNA was reacted with the catalytic RNA R51, no cleavageof this substrate occurred. However, when a new catalytic RNA,containing a "C" at position 214, rather than the "G" found in R51, wassynthesized according to the methods in Example 2 and added to thissubstrate, full cleavage was seen. The effect of the base change from"C" to "G" in the substrate was to eliminate the base pairing at thisposition predicted by the "hairpin" model since now a "G" would beacross from a "G". However, when a "G" to "C" base change was made inthe catalytic RNA, the base pairing was restored, but in a reversemanner, and the integrity of the helices in the stem regions where thesubstrate RNA binds to the catalytic RNA was thus conserved restoringcatalytic activity (see FIG. 12).

Example 17

An RNA sequence found within the sequence of tobacco mosaic virus wassynthesized using the methods described in Example 2. This synthesizedtarget RNA had the sequence 5'gAAACAGUCCCCAAC 3'(SEQ ID NO 11). Acatalytic RNA was synthesized with the sequence5'-GUUGGGAGAAGUUUACCAGAGAAACACACGUUGUGGUAUAUUACCUGGUA-3 (SEQ ID NO 12)selected so that base pairing between the substrate and the catalyticRNA is maintained in the "hairpin" configuration (see FIGS. 13A-13B).When these two RNAS were mixed under standard catalytic conditions asdescribed in Example 3, the target was cleaved demonstrating that asequence found within a native viral RNA can be cleaved.

Example 18

Three RNA sequences found within the sequence of the messenger RNA forthe enzyme chloramphenicol acetyl transferase (CAT) were synthesizedusing the methods described in Example 2. The synthesized substrate RNAshad the sequences (A) gUUUCAGUCAGUUGC; (B) gUUUCAGUCAGUUGCUC; and (C)gggUUUCAGUCAGUUGCUCAA (SEQ ID NOS 13-15) (see FIGS. 14A-14F).

Note that the latter two sequences are extensions of the first sequenceand that additional bases have been added to the 3' end in the regionthat the "hairpin" model predicts will base pair with the catalytic RNAto form Helix 1 (see FIG. 18). Also, substrate (C) had two additional Gvector bases as compared to substrates (A) and (B). The site of cleavageafter the A in the AGUC cleavage sequence (see the arrow in FIGS.14A-14F) of the substrates corresponds to position 320 of the CAT geneusing the number system found in the Tn9 sequence (Alton and Vapnak,Nature, 282, 864 (1979)). In FIGS. 14A-14F, the open boxed bases arethose which are different from those in the native (-)sTRSV substrateRNA sequence shown in FIG. 1.

Catalytic RNAs corresponding to substrate RNAs (A), (B) and (C) weresynthesized according to the methods described in Example 2. Theirsequences were designed so that they would base pair with the substrateRNAs in both the 3' and 5' regions flanking the AGUC cleavage sequence.In addition, the catalytic RNAs designed to react with substrate RNAs(A) and (B) each contained the vector sequence GA at their 5' terminus,and the catalytic RNA designed to react with substrate RNA (C) containedthe vector sequence GGG at its 5' terminus. Otherwise, the catalyticRNAs had the same sequence as the (-)sTRSV catalytic RNA sequence shownin FIG. 1.

The various substrate and catalytic RNAs were reacted and the reactionproducts analyzed as described in Example 3. All reaction conditionswere as described in Example 3, except for the following. For substrates(A) and (B), reaction conditions were: substrate RNA concentration =0.05uM; catalytic RNA concentration =0.005 uM; reaction run at 16° C.; andreaction time of 20 minutes. For substrate (C), the reaction conditionswere the same as for (A) and (B), except that the reaction time was 40minutes and temperatures were 20° C., 25° C., 30° C. and 37° C.

Cleavage of all of the substrate RNAs occurred when they were mixed withthe corresponding catalytic RNAs as is shown in FIGS. 14A-14F,demonstrating that catalytic RNAs according to the invention can besynthesized which cleave specific RNA sequences found within a messengerRNA. In addition, this example demonstrates that cleavage of substrateRNA can be obtained even though the length of the region at the 3' endof the substrate which base pairs with the catalyst (i.e. the portionthat forms Helix 1 with the substrate according to the "hairpin" model)is varied. Indeed, when the length of this region was extended to 10bases in substrate (C), the reaction could then proceed at 37° C.,whereas for substrates (A) and (B) having shorter sequences in thisregion, the reaction would proceed only at 16° C.

Example 19

An RNA substrate corresponding to part of the sequence of HIV-1, thevirus which causes AIDS, was synthesized as described in Example 2. Thesequence of this substrate RNA is shown in FIG. 15A. The arrow in FIG.15A shows the cleavage site which corresponds to position 804 in thesequence of HIV-1 strain SF2CG (Sanchez-Pescador et al., Science, 227,484 (1985). The sequence shown is found in the region of the viral RNAwhich specifies the gag protein. The RNA substrate also had a GCG 5'vector sequence. The open boxed bases in FIG. 15A are those which aredifferent than those of the native (-)sTRSV substrate sequence shown inFIG. 1.

A catalytic RNA was synthesized according to the methods of Example 2.Its sequence was designed so that it would base pair with the substrateRNA in both the 3' and 5' regions flanking the CGUC cleavage sequence.In addition, the catalytic RNA contained the vector sequence GGG at its5' terminus. Otherwise, the catalytic RNA had the same sequence as the(-)sTRSV catalytic RNA sequence shown in FIG. 1.

The catalytic RNA and the substrate RNA were reacted and the reactionproducts were analyzed as described in Example 3. The reactionconditions were as set forth in Example 3, except that the followingtemperatures were used: 20° C., 25° C., 30° C. and 37° C. Also, thereaction was run for 60 minutes, except for 37° C. which was run for 15minutes, and the substrate RNA concentration was 50 nM, and thecatalytic RNA concentration was 5 nM.

The catalytic RNA cleaved the substrate at the expected position betweenthe "C" and "G" in the CGUC cleavage sequence found in the loop betweenthe two flanking stem regions. Thus, a specific sequence found in theHIV-1 viral RNA that codes for the gag protein can be cleaved with acatalytic RNA according to the invention. The reaction occurred with anRNA substrate having a 16-base target sequence which was longer than S17in the region at the 3' end which base pairs with the catalyst (ie., theportion that forms Helix 1 with the catalytic RNA according to the"hairpin" model). Also, the reaction occurred at physiologicaltemperature of 37° C.

Example 20

A substrate RNA having a sequence found in the beginning of the codingregion for the regulatory protein tat of HIV-1 virus was synthesized asdescribed in Example 2. The substrate sequence is shown in FIG. 16A. Inaddition to the HIV-derived sequence, the substrate RNA had a GCG 5'vector sequence. The open boxes are around bases which are differentthan those of the native (-)sTRSV substrate sequence shown in FIG. 1.

A catalytic RNA having a sequence so that it would base pair with thesubstrate RNA in the two regions flanking the UGUC loop (ie., theregions that forms Helices 1 and 2 with the catalytic RNA according tothe "hairpin" model) was also synthesized as described in Example 2.Otherwise, the catalytic RNAs had the same sequence as the (-)sTRSVcatalytic RNA sequence shown in FIG. 1, except that it had an additional5'G vector base.

The substrate RNA and catalytic RNA were reacted and the reactionproducts were analyzed as described in Example 3. Reaction conditionswere: 37° C.; reaction times of zero and 15 minutes; the concentrationof substrate RNA was 100 nM; and the concentration of catalytic RNA was20 nM.

Cleavage occurred as expected between the "U" and the "G" in the UGUCcleavage sequence located between the two stem regions of the substrate.The cleavage site is indicated by the arrow in FIG. 16A. This isposition 5366 in the sequence of HIV clone h9c (Muesing et al., Nature,313, 450 (1985)). These results again confirm that a catalytic RNAdesigned according to the "hairpin" model can cleave a specific targetsequence located in a naturally occurring RNA, in this case a keyregulatory region (tat) of the HIV-1 virus which causes AIDS.

Example 21

Using the methods described in Example 2, a substrate RNA having fournon-native bases (UUUU) added to the 3' end of the sequence of thenative (-)sTRSV substrate shown in FIG. 1 and a corresponding catalyticRNA designed to base-pair with the substrate RNA in the 3' and 5'regions of the substrate flanking the cleavage sequence (i.e., theportions that form Helices 1 and 2 with the catalyst according to the"hairpin" model) were made. Thus, the substrate RNA contained a total of18 bases, including a four-base extension of the 3' region that basepairs with the catalyst. The substrate RNA also had an additional GCGvector sequence at the 5' end. The catalyst had the same sequence as the(-)sTRSV catalytic RNA sequence shown in FIG. 1, except that it had fouradditional AAAA bases at the 5' end designed to base pair with the addedUUUU bases of the substrate and had an additional "G" vector base at the5' terminus beyond the 18 base recognition site.

The substrate and catalytic RNAs were reacted at standard conditions andthe reaction products were analyzed as described in Examples 3 and 5.Catalytic RNA concentration was 0.00033 uM, and substrate RNAconcentration was 0.1 uM (Lane 1), 0.05 uM (Lane 2), 0.033 uM (Lane 3),0.025 uM (Lane 4), 0.016 uM (Lane 5), 0.012 uM (Lane 6), 0.006 uM (Lane7), and 0.003 uM (Lane 8). The results are shown in FIGS. 17A-17B.Cleavage rates at each concentration of substrate were determined bycutting out the bands, counting radioactivity in a liquid scintillationcounter and plotting the data using Michaelis-Menton procedures tocalculate Km and kcat (see Example 5).

The data show that an extension of the length of the region of basepairing between the substrate and catalyst (i.e., those regions of thecatalyst and substrate that form Helix 1 according to the "hairpin"model) can improve the catalytic properties of the reaction. Cleavage ofthe 18-base RNA substrate occurred at the expected site, but at anincreased rate as compared to the cleavage of S17 by R51. The kcat orturnover number of the reaction was 7/minute. This means that eachmolecule of catalytic RNA cleaved 7 molecules of substrate RNA perminute during the reaction. The kcat for S17 cleavage by R51 was2.1/minute. The Km of the reaction was the same as for S17 cleavage byR51 (30 nM). This shows that by optimizing the length of the region ofthe catalyst that base pairs to the substrate in the 3' region flankingthe cleavage sequence (ie., by optimizing the length of Helix 1predicted by the "hairpin" model), the catalytic properties of thenative reaction can be improved.

Example 22

A series of catalytic RNAs were prepared using the methods described inExample 2 having certain base substitutions as compared to the native(-)sTRSV catalytic RNA sequence shown in FIG. 1. The substitutions,which are illustrated in FIG. 19A, were the following: (1) At nucleotide35, G was replaced by C (G35→C) (count bases from the 5' end of thecatalytic RNA sequence shown); (2) A double mutant, having the G atposition 35 replaced by C and the C at position 27 replaced by G (G35→C;C27→G); (3) At nucleotide 47, G was replaced by C (G47→C); and (4) Adouble mutant having the G at position 47 replaced by C and the C atposition 17 replaced by G (G47→C; C17→G). All catalytic RNAs had anadditional "G" vector base at their 5' end.

The various catalytic RNAs were reacted with substrate S17 and thereaction products were analyzed as described in Example 3. The resultsare shown in FIG. 19B where Lanes 1, 3, 5, 7, 9 and 11 are at zero time,and Lanes 2, 4, 6, 8, 10 and 12 are 15 minutes incubation under standardcleavage conditions (see Example 3). The concentration of catalytic RIIAwas 0.0065 uM, and the concentration of substrate RNA was 0.17 uM. Thetemperature was 37° C. The control, lanes 1, 2, 7 and 8, was cleavage ofthe native substrate RNA S17 by catalytic RNA R51 which has the native(-)sTRSV sequence (see FIG. 1).

As shown in FIG. 19A, the catalytic RNA G35→C had no catalytic activity(see Lanes 3 and 4 of FIG. 19B, where this catalytic RNA is designated"mismatch" since the base substitution at position at position 35results in a loss of base pairing). The double mutant catalytic RNAG35→C; C27→G showed restored catalytic activity (see Lanes 5 and 6 ofFIG. 19B where this catalytic RNA is designated "substitute b.p." sincethe net effect of the two base substitutions is to create a base pair,but a base pair different than the one found in the native (-)sTRSVcatalytic RNA). The catalytic RNA G47→C was also inactive (see Lanes 9and 10 of FIG. 19B), while the double mutant, with the second mutationC17→G, showed restored activity (see Lanes 11 and 12 of FIG. 19B).

The results of these mutagenesis studies provide confirmation for theexistence of Helices 3 and 4 (see FIGS. 19A-19B) of the "hairpin"catalytic RNA model proposed herein. The results show that whenmutagenesis was carried out which caused a base pair mismatch in theregion of proposed base pairing, the catalytic RNA was inactive.However, when a second mutation was carried out so that the base pairwas restored, catalytic activity was restored, even though the new basepair was different than the original base pair. Such results areconsidered evidence for the existence of a helix (Fox and Woese, Nature,256, 505 (1975). These results also show that a variety of catalyticRNAs having sequences different from the native (-) sTRSV sequence arecatalytically active if they are designed so that they follow certainrules derived from the "hairpin" model, such as maintenance ofsubstantial base pairing in regions of predicted helices.

Example 23

A synthetic "hairpin" autocatalytic cassette was prepared. The RNAsequence of the cassette is shown in FIG. 20A. Several bases have beenadded at the 5' end of the catalyst as compared to the native (-)sTRSVsequence shown in FIG. 1. The effect is to close the open end of the"hairpin" and to provide a substrate sequence (i.e., a cleavage sequenceand upstream and downstream flanking regions) which can base pair withthe substrate binding portion of the catalyst.

The cassette was prepared by making an appropriate synthetic DNAtemplate that would yield an RNA with the sequence shown in FIG. 20A andusing the DNA template to transcribe RNA as described in Example 2, withthe following differences. After synthesizing the DNA template, it wasinserted into the vector pTZ18R (U.S. Biochemical) into which a newmultiple cloning site had been inserted. The new multiple cloning sitewas a construct containing sites, in 5'→3' order, for cleavage by thefollowing enzymes: BamHI, XhoI, APaI, SacII, NaeI, StuI, KpnI, MluI. Thenew multiple cloning site was inserted into vector pTZ18R by cleavingthe vector with BamHI and SalI and then ligating the multiple cloningsite to the vector using T4 ligase. Located 5' to the inserted multiplecloning site is the T7 RNA polymerase promoter. The vector containingthe multiple cloning site was digested with MluI and SalI, and the DNAtemplate coding for the autocatalytic cassette was ligated into thevector using T4 ligase. The resultant vector was then linearized withHindIII, and transcription carried out as described in Example 2. Allrestriction enzymes and the T4 ligase were obtained from IBI and usedaccording to manufacturer's instructions.

After being transcribed, the cassette spontaneously underwent anintramolecular autocatalytic cleavage at the expected site to give theappropriate 3' F and 5' F products (see FIGS. 20A-20B). Note that theeffect of this is to autocatalytically terminate a transcript. Forexample, the 5' F is in itself a transcript which has been terminated atits 3' end by the autocatalytic reaction. Further note that thistermination only leaves five essential bases at the 3' end of this 5' F.These are UGACA which are boxed in FIG. 20A. Thus, it is possible tovery efficiently terminate transcription with this autocatalyticcassette and leave only a very short 3' end to the transcript (ie., the5' fragment of the autocatalytic cleavage).

Example 24

A catalytic RNA was prepared as described in Example 2. Its sequence wasidentical to that of R51, except that the base at position 217 in theAGAA loop was changed from a G to a C. The AGAA loop of the catalyst isopposite the cleavage sequence of the substrate when the substrate andcatalyst are complexed (see FIG. 1)

Another catalytic RNA was prepared as described in Example 2. Itssequence was also identical to R51, except that the base at position 216was changed from A to U.

These catalysts were reacted with the RNA substrate S17 under standardconditions as described in Example 3 at 37° C. for 0 and 15 minutes. Thecontrol was the reaction of substrate S17 with catalyst R51. Theconcentration of the three catalytic RNAs was 0.007 uM, and theconcentration of substrate RNA was 0.7 uM.

The reaction products were analyzed as described in Example 3, and theresults are shown in FIGS. 21A-21B, where the first lane in each gel iszero time and the second lane is 15 minutes of reaction. As shown inFIGS. 21A-21B, changes in either one of these two bases (G217→C andA216→U) in the loop opposite the cleavage sequence of the substratedestroyed the activity of the catalyst. These results indicate thatthese two bases are likely invariant in the native (-)sTRSV catalyticsequence.

A third catalytic RNA was prepared as described in Example 2. Itssequence was identical to R51, except that the base at position 218 waschanged from A to C. This catalyst was also reacted with substrate S17as described above. Cleavage of the substrate with this catalyst wasobserved, but only at about 47% of the level achieved with R51 (data notshown).

Example 25

Substrate RNAs identical to S17, but with one of the bases at positions46, 47 or 48 (i.e. bases GUC of the cleavage sequence) changed to adifferent base, were prepared as described in Example 2. When thesesubstrates were reacted with the RNA catalyst R51 under standardconditions for 60 minutes as described in Example 3, the followingresults were obtained:

    ______________________________________    Mutation   Activity (% of activity with S17)    ______________________________________    G48->A     0    G48->C     0    U47->G     2    U47->A     7    U47->C     15    C46->G     1-2    C46->A     1-2    C46->U     1-2    ______________________________________

This shows that U47 and C46 are not absolutely essential for cleavage ofthe substrate. However, activity was 6-100 times greater when thecleaveage sequence was GUC than when U47 and C46 were changed to otherbases. The results of these experiments combined with the results ofExamples 13-15 show that the cleavage sequence of (-)sTRSV RNA is mostpreferably NGUC, where N is any base.

Example 26

The vector prepared in Example 23 (hereinafter referred to as "pHC")containing the "hairpin" autocatalytic cassette was tested for activityin vivo as follows. First, the CAT gene was removed from plasmidpMAMNEO-CAT (purchased from Clontech Inc.) with SmaI and XhoI. It wasthen ligated using T4 ligase into pHC which had been cut with SmaI andXhoI to produce vector pHC-CAT.

The original vector used in these constructions (pTZ18R; see Example 23)contains an inducible promotor (lacZ) and, as a result of the stepsdescribed in Example 23 and immediately above in this example, the CATgene and the "hairpin" autocatalytic cassette were placed in thisinducible region (see FIG. 22A). Also, the CAT gene and "hairpin"autocatalytic cassette were linked so that the expected transcript wouldbe as shown in FIG. 22A (the "CAT-cassette RNA") and so that the RNAsequence coded for by the "hairpin" autocatalytic cassette would beexpected to serve as a chain terminator for the CAT transcript bycleaving at the indicated cleavage site (see FIG. 22A). The expected 5'fragment of such a cleavage is also shown in FIG. 22A ("CleavedCAT-cassette RNA").

Next, pHC-CAT was transfected into Escherichia coli strain JM109 (widelyavailable from a number of commercial sources and from the American TypeCulture Collection) with calcium chloride and heat shock using standardprocedures as described in Maniatis et al., Molecular Cloning (1983).Transformed cells containing pHC-CAT were selected on the basis ofampicillin resistance by plating on double concentration YT mediumcontaining 100 ug/ml ampicillin. After selection, the transformed cellswere grown overnight in LB broth at 37° C. A fresh culture of thesecells was then grown for 5 hours in LB broth at 37° C., after which thecells were induced with 1 mM isopropyl-beta-D-thiogalactoside (IPTG) forone hour to allow expression of the lacZ region, including the CATgene-cassette RNA transcript.

At the end of this time, RNA was isolated by incubating the cells in 50mM Tris, pH 8.0, 50 mM ethylene-diaminetetraacetic acid (EDTA), 1 mg/mllysozyme at room temperature for 10 minutes to lyse the cells. Thelysate was made to 0.5% sodium dodecyl sulfate (SDS), and thencentrifuged to remove cell debris. Phenol was added to the supernatantat a ratio of 1:1, and the supernatant was centrifuged to remove theprecipitate. This procedure was repeated, and the resulting aqueousphase was treated with an equal volume of isopropanol at -20° C. for 20minutes to precipitate the RNA. The precipitate was collected bycentrifugation, dried, and redissolved in water.

This isolated RNA was electrophoresed on 1.2% agarose gels containingformaldehyde as described in Current Protocols In Molecular Biology(Greene 1989). After electrophoresis, Northern blots were carried outusing published methods (GeneScreen Plus, DuPont, July 1985). Two DNAprobes were used for blotting the gels. The CAT probe was prepared byprimer extension of the CAT gene using the Klenow fragment of DNApolymerase I and DATP labelled with alpha P³². The "hairpin"autocatalytic cassette probe was prepared by kinasing the DNA complementto the entire "hairpin" autocatalytic cassette RNA sequence shown inFIGS. 20A-20B with DATP labelled with gamma P³². The expected positionsof binding of these probes to the CAT-cassette RNA and expected 5'fragment are shown in FIG. 22A.

The results of the Northern blot test are shown in FIGS. 22B-22C. Asshown there, when the CAT probe was used, the full length CAT-cassetteRNA transcript and the expected 5' fragment were detected on the gel,indicating that cleavage had taken place in vivo.

When the "hairpin" autocatalytic cassette probe was used, only the fulllength CAT-cassette RNA transcript was detected (see FIGS. 22B-22C). Thefact that the 5' fragment did not hybridize with this probe was to beexpected, since most of the "hairpin" autocatalytic cassette transcriptwould be in the 3' fragment after cleavage. Although it would beexpected that the "hairpin" autocatalytic cassette probe would hybridizeto the 3' fragment, the fact that the 3' fragment was not detected byNorthern blot analysis is not surprising. The 5' terminus of the 3'fragment would contain a 5'-OH and not the 5'-ppp which is ordinarilyseen in RNA transcripts. Thus, the 3' fragment would be expected to bevery labile in vivo and was likely degraded immediately after theautocatalytic cleavage.

The mobilities of the CAT-cassette RNA and the Cleaved CAT-cassette 5'fragment on the gel corresponded exactly to the predicted length of thetranscripts. The standards run on the gel were E. coli 16S and 23S RNA.

An important control was to determine whether cleavage of the target RNAoccurred during isolation of the RNA. Intact, uncleaved "hairpin"autocatalytic RNA prepared and separated on an acrylamide gel asdescribed in Example 23 was isolated from the gel using conditionssimilar to those described above (i.e., no divalent cation, but in thepresence of EDTA and SDS). Only intact, uncleaved RNA was obtained whenthe catalyst was re-electrophoresed on an acrylamide gel.

Example 27

Mammalian expression vector pMHC-CAT containing the "hairpin"autocatalytic cassette linked to the CAT gene was constructed asfollows. First, the "hairpin" auto-catalytic cassette was removed fromthe vector pHC (prepared as described in Example 23) by digesting pHCwith SmaI/SalI and ligating the resulting fragment to the mammalianvector pMSG (purchased from Pharmacia) at the SmaI/XhoI sites to givepMHC. This operation removed the XhoI site used to clone the "hairpin"autocatalytic cassette insert, but introduced another XhoI site near the5' end of the insert.

Next, the CAT gene was removed from pMAM-NEO-CAT (Clontech, Palo Alto,Calif.) with SmaI/XhoI and cloned into pMHC at the SmaI/XhoI sites togive the final vector pMHC-CAT which is shown in FIG. 23. Note that theCAT gene and the "hairpin" autocatalytic cassette are both driven by theMMTV LTR promoter which is dexamethasone inducible.

The vector PMHC-CAT was co-transfected into Chinese hamster ovary cells(CHO) by the calcium phosphate method along with the vector pMSG-dhfr (agift from Dr. Ed Prochownick of the University of Michigan, Ann Arbor,Mich.) which contained a mutant dhfr gene. The isolation of this mutantdhfr gene is described in Simonson and Levinson, Proc. Nat'l Acad. Sci.80, 2495-99 (1983), and pMSG is available from Pharmacia (see above).

Transfectants were selected using gpt medium (Molec. & Cell. Biol., 3,1421-29 (1983)) which kills nontransfected cells. They were nextamplified with methotrexate (which amplifies the mutant dhfr gene andassociated DNA) to give CHO cells which contained the "hairpin"autocatalytic cassette linked to the CAT gene stably integrated into thegenome. Dexamethasone (1 uM) was added to induce expression of the CATgene and "hairpin" autocatalytic cassette, and RNA was isolated from theindividual cells by the guanidinium isothiocyanate procedure (CurrentProtocols in Molecular Biology).

Next, a nuclease protection assay was performed to identify the natureof the transcripts and cleavage products, if any, produced bytranscription of the "hairpin" autocatalytic cassette-CAT gene fusion.The probe used in the S1 nuclease assay was prepared from the vectorpHC19R. This vector was prepared by removing the "hairpin" autocatalyticcassette from pHC with BamHI/SalI and ligating this fragment into theBamHI/SalI sites of the vector pTZ19R (purchased from US Biochemical) asshown in FIG. 24. Vector pHC19R was linearized with BamHI, transcribedwith T7 RNA polymerase using P³² -CTP as described in Example 2, and thetranscript was isolated from 7M urea gels, also as described in Example2. This RNA probe is 148 nt long and is the complement to-the "hairpin"autocatalytic cassette RNA.

The probe was then hybridized in buffer (40 mM Pipes, pH 6, 100 mM NaCl,20% formamide, 1 mM ZnSO₄) to RNA (5 ug) isolated from CHO cells thathad been transfected with vector pMHC-CAT, and the hybridized RNA wasdigested with S1 nuclease (100 units, 1 hr., 37° C.). S1 nuclease willdigest all single-stranded RNA, but not double-stranded RNA. A negativecontrol was RNA from transfected CHO cells which had not been induced. Apositive control was RNA isolated from the pHC plasmid which had beentranscribed in vitro with T7 RNA polymerase as described in Example 2.

The S1 digests were electrophoresed on 10% acrylamide-7M urea gels. Theresults are shown in FIG. 25 where: Lane 1 contains the RNA probe alone;Lane 2 contains the probe digested with S1 nuclease; Lane 3 contains invitro transcribed "hairpin" autocatalytic cassette RNA, 5' F and 3' Fthat have been hybridized to probe and digested with S1 nuclease(positive control); Lane 4 contains RNA isolated from CHO cells that hadbeen transfected with PMHC-CAT but were not induced by dexamethasone,hybridized to the probe and digested with S1 nuclease; and Lane 5contains RNA from CHO cells that had been transfected with pMHC-CAT andinduced with dexamethasone, hybridized to the probe and digested with S1nuclease. All mobilities were as expected, and, as can been seen in Lane5, RNA transcribed from the "hairpin" autocatalytic cassette-CAT genefusion was cleaved in vivo into the expected products.

Another control was uncleaved "hairpin" auto-catalytic cassette RNAhybridized to the probe, S1 nuclease digested and electrophoresed underthe same conditions. No cleavage products were seen (data not shown).This control shows that cleavage of the "hairpin" autocatalytic cassetteRNA did not occur as a result of the analysis conditions and, therefore,that the cleavage products seen in Lane 5 must have been produced as aresult of in vivo cleavage of the hybrid CAT-"hairpin" autocatalyticRNA.

Example 28

The following example illustrates in vivo activity of the "hairpin"catalytic RNA in plants. The "hairpin" autocatalytic cassette (seeExample 23) was ligated to cauliflower mosaic virus (CMV) in a viralvector. Plants were then transformed with the resulting vector and,during replication of the virus, the "hairpin" autocatalytic cassetteRNA cleaved the viral RNA intramolecularly. The viral RNA serves as atemplate for viral replication and for attenuation of the virus. Sincecleavage levels of the "hairpin" autocatalytic cassette were about50-60% in vitro, attenuated viral infection in these plants would beexpected if the "hairpin" autocatalytic cassette RNA linked to the CMVRNA did cleave in vivo in the plants, and this was what was observed.

The constructions tested had the "hairpin" autocatalytic cassette in thesense and antisense orientation. They are shown in FIG. 26.

These constructions were made by removing the "hairpin" autocatalyticcassette from vector pHC (prepared as described in Example 23) withXhoI/SalI and ligating this fragment into the unique XhoI site of CMVplasmid pCS101 (a gift from Dr. Art Hunt, University of Kentucky).Plasmid pCS101 contains the entire CMV sequence. The XhoI site islocated in gene II of CMV and was chosen because DNA can be cloned intothis site without subsequently interfering with plant infection. Anotherplasmid identical to pCS101, except having a pBR322 bacterial replicon,is available from American Type Culture Collection, and can be used inits place as a source of CMV. The "hairpin" autocatalytic cassette couldbe inserted in either of two orientations (sense or antisense) sinceXhoI and SalI ends are compatible.

The resulting constructions were grown in XL-1 blue E. coli(Stratagene), and clones of the "hairpin" autocatalytic cassette insertwere isolated and identified by cleaving with KpnI and byelectrophoresing the insert on 1% agarose gels. The isolated clones werepCS101HC7 and pCS101HC9 which had the "hairpin" autocatalytic cassettein the sense and antisense orientation, respectively. Theseconstructions are shown in FIG. 26. Construction pCS101HC7 was designedso that CMV 35S RNA containing the "hairpin" autocatalytic cassettewould be transcribed from it and cleaved if the "hairpin" autocatalyticcassette was active in vivo.

After being grown in E. coli, the constructions were cut with SalI, andwere rubbed onto 10 turnip (Brassica campestis) plants (0.5 ugDNA/plant) which were three weeks old. Four groups of plants weretreated as follows:

A. Control--mock inoculated (no virus)

B. Virus control--no "hairpin" autocatalytic cassette insert--pCS101

C. Virus with the sense "hairpin" autocatalytic cassette--pCS101HC7

D. Virus with the antisense "hairpin" autocatalytic cassette--pCS101HC9

The plants were checked weekly for viral symptoms. Within six weeks,plants treated with wild-type virus were observed to have viralsymptoms, including vein clearing, leaf wrinkling and a yellow mosaicpattern on the leaf. In turnips treated with pCS101HC7, the onset ofsymptoms was delayed 7-10 days as compared to turnips infected withwild-type virus, and the severity of the symptoms never reached thelevels attained with the wild-type virus. The following was observedwhen the plants were nine weeks old:

    ______________________________________    Treatment       Symptoms    ______________________________________    A               No symptoms    B               Viral symptoms, worst case,                    most yellowing (chlorosis)                    and most lesions.    C               Viral symptoms, distinct                    chlorosis and lesions,                    but much less than in the                    virus control.    D               Viral symptoms, less than                    viral control, but not                    as much yellowing as the                    virus with the sense                    "hairpin" autocatalytic                    cassette. Some plants                    looked uninfected.    ______________________________________

These results indicate that a "hairpin" catalytic RNA according to theinvention can cleave viral RNA in which it is inserted so that the viralinfection is attenuated (Treatment C). Surprisingly, viral attenuationwas also obtained with the "hairpin" autocatalytic cassette linked tothe viral DNA in the antisense direction (Treatment D). This is believedto be due to the deletion or disabling of the viral construct containingthe antisense catalyst, so that viral replication is diminished comparedto virus control (Treatment B).

Next, RNA was isolated from the leaves of the nine-week old plants, aprobe was prepared and hybridized to the isolated RNA, S1 nucleasedigestion was carried out, and the S1 digests separated on 10%acrylamide-7M urea gels and analyzed by autoradiography, all asdescribed in Example 27. The probe used was P³² -labelled autocatalyticcassette RNA. The results are shown in FIG. 27A, where the 3' F and 5' Fresulting from in vivo RNA cleavage were found only in plants infectedwith pCS101HC7 having the "hairpin" autocatalytic cassette inserted inthe sense direction (see Lane 5). None of the other control plants,including those infected with the antisense construct pCS101HC9, showedthese cleavage products, (see Lanes 3, 4 and 6).

RNA isolated from the leaves of the nine-week old plants was alsosubjected to Northern blot analysis. Total RNA was prepared by theguanidium thiocyanate method (Chirgwin et al., Biochemistry, 18, 5294-99(1979)) followed by pelleting through a 5.7M CsCl step gradient. Then,2.5 ug of total RNA was denatured at 65° C. for 5 min. with formaldehydeand formamide. The denatured RNA was then electrophoresed on a 1%agarose/formaldehyde gel, and the RNA was transferred to a Duralon-UVnylon filter (Stratagene) in 10× standard sodium citrate (SSC) (0.15M).After UV crosslinking, the filter was prehybridized in 6× SSC and 0.05×BLOTTO (Sambrook, Fritsch and Maniatis, Molecular Cloning § 1.102 (2nded. 1989)) for 3 hrs at 68° C. The "hairpin" autocatalytic cassettefragment was labeled with P³² using the oligolabeling procedure ofFeinberg and Vogelstein, Anal. Biochem., 12, 6-9 (1983). Then it washeated to 100° C. for 5 min. and cooled on ice. This probe was added tothe prehybridization mix, and incubation was continued at 68° C.overnight. The nylon filter was washed once in 2× SSC, 0.1% SDS for 20min. at room temperature, followed by washing with 1× SSC, 0.1% SDS for60 min. at 68° C. and 0.1× SSC, 0.1% SDS for 60 min. at 68° C. Thefilter was then exposed overnight to Kodak X-OMAT film using twolightning plus screens. The results are shown in FIG. 27B. Of interestis the detection of viral RNA transcripts of about the size expectedbefore cleavage (8 Kb) and after cleavage (6 Kb and 2 Kb) followinginfection with pCS101HC7 (Lane 3).

Some reproducible size heterogenity was observed in the S1 fragments andin the Northern blots. This may be due to the rapid degradation by plantnucleases of these RNA fragments which contain either a 2'-3' cyclicphosphate or a 5' hydroxyl after ribozyme cleavage. The 5' hydroxylcould mimic naturally occurring RNA degradation signals. After infectingturnips with pCS101HC7, RNAs of the sizes expected before and after RNAcleavage were detected, but the amount of the 6 Kb fragment detected wassomewhat less than the 2 Kb fragment. This could be due to enhancedstability of 2 Kb transcript which contained a cyclic phosphate over the6 Kb transcript which contained the 5' hydroxyl.

To determine if the DNA encoding the "hairpin" autocatalytic ribozymewas stable in vivo, total DNA was isolated from the leaves ofmock-inoculated turnips, turnips inoculated with wild-type CMV (pCS101)and turnips inoculated with pCS101HC7. DNA was isolated from plantsessentially as described in Murray and Thompson, Nucleic Acids Research,8, 4321-25 (1980). Oligonucleotide primers homologous to domains 5' and3' to the XhoI site of the parental CMV clone pCS101 were used asprimers to amplify DNA sequences between these domains by polymerasechain reaction (PCR). Primer 1, which includes an EcoRI site, hybridizes63 bases upstream from the XhoI site and has the following sequence:

    5'-GGAATTCACC CGTCAGTTTT TAATACTGC-3'                       SEQ ID 16!

Primer 2 includes a BamHI site and hybridizes 54 bases downstream fromthe XhoI site and has the following sequence:

    5'-TGGATCCATT CTAGTATRTMG AGCTTCT-3'                        SEQ ID 17!

The primers were synthesized on an Applied Biosystem 391 PCR-MATE usingphosphoramidite chemistry. PCR was performed as described by the vendorof TaQ polymerase (Perkin-Elmer). Briefly, PCR conditions were 94° C.for 1 min., 55° C. for 2 min. and 72° C. for 3 min. for 35 cycles. AfterPCR amplification, the amplified DNA was size fractionated on a 2%agarose gel and stained with ethidium bromide.

The results are shown in FIG. 27C. When total plant DNA frommock-infected turnips was fractionated, no bands were observed (FIG.27C, lane 1). Total DNA isolated from plants infected with wild-type CMVgave an expected band of about 123 bp (FIG. 27C, lane 2). DNA isolatedfrom plants infected with pCS101HC7 gave an expected band of 225 bp(FIG. 27C, lane 3). In the latter two cases, the amplified DNA was thesame size as those bands amplified using the intact plasmids pCS101 andpCS101HC7 (FIG. 27C, lanes 4 and 5). These results indicate that atleast the majority of the DNA coding for the "hairpin" autocatalyticribozyme was retained intact in the viral genome after infection.

Finally, protein extracts were prepared from plants that weremock-inoculated, inoculated with wild-type CMV and inoculated withpCS101HC7, and the levels of CMV coat protein in the extracts determinedby Western immunoblot. The extracts were prepared by homogenizing turnipleaf tissue in an equal volume of phosphate buffered saline using amortar and pestle. Samples were boiled for 10 min. and then spun down ina microfuge for 10 min. at 4° C. Protein concentrations of thesupernatants were determined by the method of Bradford, Anal. Biochem.,72, 248-54 (1976), and the extracts were diluted with an equal volume of4× Laemmli sample buffer and denatured at 100° C. for 5 min. (Laemmli,Nature, 227, 680-85 (1970)). The samples were next electrophoresed on10% SDS-polyacrylamide gels with 5% stacking gel (id.), followed byelectrotransfer of the proteins to nitrocellulose at 45 volts, 1.25hours, in transfer buffer as described in Towbin et al., Proc. Natl.Acad. Sci. USA, 76, 4350-54 (1979). Prestained molecular weight markers(BRL) were used to confirm protein transfer. Nonspecific antibodybinding was avoided by using blocking solution (Johnson et al., GeneAnal. Techn., 1, 3-8 (1984)). All antibody incubations were done at 22°C. with gentle agitation. The primary antibody was rabbit antiserum toCMV coat protein (1:1000 dilution in blocking solution) provided by Dr.R. Shepherd, University of Kentucky. An anti-rabbit IgG alkalinephosphatase conjugate (Sigma, St. Louis) was used as the secondaryantibody (1:1000 dilution in blocking solution). Visual detection ofproteins was accomplished using BCIP/NBT (Sambrook et al., MolecularCloning: A Laboratory Manual (2nd ed. Cold Spring Harbor Laboratory1989)).

The results are presented in FIGS. 27D and 27E. No coat protein wasobserved in extracts from mock-inoculated plant tissues one or twomonths after infection (FIGS. 27D and 27E, lane 1). In extracts fromplants inoculated with wild-type virus, coat protein was found insignificant quantities at one month and two months post infection (FIGS.27D and FIG. 27E, lane 2). One month after infection with the ribozymeconstruct pCS101HC7, very little coat protein was detected (FIG. 27D,lane 3). Two months after infection with pCS101HC7, increased levels ofcoat protein approaching those found after wild-type CMV infection weredetected (FIG. 27E, lane 3).

The combined results of the pathological observations and the S1nuclease, Northern blot, PCR amplification and Western immunoblot assaysprovide conclusive evidence that the "hairpin" catalytic RNA cleavedviral RNA in vivo in plants.

Example 29

A highly conserved sequence in HIV-1 viral RNA has been discovered whichhas favorable properties as a potential target site for a suitablyengineered "hairpin" catalytic RNA. The 16-base sequence is shown inFIG. 28. Cleavage occurs between the two bases found at positions111/112 as counted from the 5' cap site (all HIV-1 sequences are fromthe HIV Sequence Data Base, prepared and distributed by Gerald Myers etal., Los Alamos National Laboratory, Los Alamos, N. Mex., telephone(505) 665-0480).

When various HIV-1 isolates were compared, only two isolates showedvariations in the sequence, and the variations were, in each case, onlya single base change (see FIG. 29) (sequences shown in FIG. 29 are fromthe HIV Sequence Data Base). The conserved target site sequence is foundin the 5' leader region of all nine HIV-1 mRNAs (see FIG. 28). Thus, acatalytic RNA engineered to cleave the conserved target sequence shouldcleave all of these mRNAs and, thereby, prevent or reduce the spread ofthe virus.

A "hairpin" catalytic RNA was designed according to the principles setforth herein to cleave the conserved target sequence. The catalytic RNAsequence, designated herein as "RHIV", is shown in FIG. 30. RHIV wasdesigned so that it would base pair with the target sequence in the tworegions flanking the CGUC cleavage sequence (see FIG. 30). The "hairpin"portion of RHIV has the same sequence as the "hairpin" portion of the(-)sTRSV catalytic RNA sequence shown in FIG. 1. Bases 3'-CUGAGGG-5' atthe 5' end of RHIV are vector bases and are not necessary for activityof the catalyst. RHIV was synthesized as described in Example 2.

A substrate RNA containing the conserved 16-base target site sequenceshown in FIG. 28 plus additional GCG vector bases at its 51 end was alsosynthesized as described in Example 2. This substrate RNA is designatedherein as "SHIV". The sequence of SHIV is shown in FIG. 30.

The substrate RNA SHIV and catalytic RNA RHIV were then reacted understandard conditions as described in Example 3 over a period of time from0 to 280 minutes. The concentration of substrate SHIV was 0.025 uM, andthe concentration of RHIV was 0.005 uM. Cleavage of substrate SHIV(which contains the conserved HIV-1 sequence) by the engineered"hairpin" catalytic RNA RHIV was obtained (see FIGS. 31A-31B).

The kinetics of the reaction between SHIV and RHIV were investigatedusing the methods described in Example 5. The results are shown in FIGS.32A-32B. The time of incubation was 5 minutes, and the concentration ofRHIV was 0.005 uM. The concentration of SHIV was: Lane 1--0.10 uM; Lane2--0.05 uM; Lane 3--0.025 uM; Lane 4--0.012 uM; Lane 5--0.006 uM; andLane 6--0.025 uM (this is the control lane at zero time).

The rate of cleavage (turnover number, kcat) was determined to be1.6/minute (see FIGS. 32A-32B). This is a very rapid rate when comparedto any other catalytic RNA under the mild conditions of temperature, pHand salt concentration used. The Km was determined to be 100 nM (seeFIGS. 32A-32B), which is also very small for an RNA catalyst under theseconditions. The low Km indicates that the reaction proceeds very well atextremely low concentrations of substrate.

Next, the ability of the catalytic RNA RHIV to cleave a long HIV-1transcript was tested. The RNA used as a target sequence was a 183nt-long transcript containing 107 nt of HIV-1 sequence. The targettranscript was made by T7 RNA polymerase transcription (as described inExample 2) of the HaeII-linearized plasmid pROS. This plasmid wasobtained from Dr. John Rossi of the City of Hope Medical Center, Duarte,Calif. Plasmid pROS has the 5'-HindIII fragment of the HIVHXB2 isolatecloned into plasmid pBluescript II (Stratagene, LaJolla, Calif.). The5'-HindIII fragment of HIVHXB2 contains the sequence (+)77 to (+)635from the 5' cap site (sequence obtained from the HIV Sequence DataBase). The 183 nt transcript contains 76 nt of vector sequence at the 5'end, followed by 107 nt of HIV-1 sequence (the sequence from (+)77 to(+)183 from the 5' cap site).

This 183 nt transcript (5 nM) was incubated with catalytic RNA RHIV (25nM) under standard conditions for 60 minutes as described in Example 3.When the products were analyzed on 6% acrylamide-7M urea gels, cleavagewas seen to have occurred (see FIGS. 33A-33B). All mobilities in FIG. 33were as expected. This result shows that RHIV, a "hairpin" catalytic RNAaccording to the invention, can cleave a long HIV-1 RNA transcript invitro under conditions near physiological for mammalian cells.

Example 30

As a prerequisite for in vivo experiments, it was necessary to develop asystem for the proper termination of the "hairpin" catalytic RNA RHIVwhen it was transcribed in vivo. Such a system was developed by cloningDNA coding for RHIV into the BamHI/MluI sites of the vector pHC(prepared as described in Example 23) to give plasmid pHR (see FIGS.34A-34B). In plasmid pHR, DNA coding for RHIV is 5' to DNA coding forthe autocatalytic cassette RNA (see FIGS. 20A-20B).

Plasmid pHR was transcribed with T7 RNA polymerase and the resulting 101nt RNA (see FIGS. 34A-34B) isolated, all as described in Example 2. Thecatalytic activity of this 101 int RNA transcript (designated herein as"PRHIV") was assayed against the substrate SHIV (see FIG. 30) asdescribed in Example 3. The concentration of catalytic RNA (RHIV orPRHIV) was 0.025 uM, and the concentration of SHIV was 0.10 uM. The 101nt PRHIV had the same catalytic activity as the shorter RHIV (see FIG.35 where R designates RHIV and PR designates PRHIV), indicating that theextra sequence at the 3' and 5' ends (see FIGS. 34A-34B) did notinterfere with cleavage of the substrate.

Next, a mammalian expression vector was prepared by removing DNA codingfor RHIV and the autocatalytic cassette RNA from pHR with SmaI/SalI (seeFIGS. 34A-34B). The resulting fragment was cloned into the mammilianvector pMSG (see Example 27) cut with SmaI/SalI to give pMSGRHIV (seeFIG. 36). In pMSGRHIV, the catalytic RNA is driven by the MMTV promoterwhich is dexamethasone inducible.

Vector pMSGRHIV has been used to transfect human cells which were theninfected with HIV-1. Preliminary results show that the amount of tat andgag mRNAs produced in transfected cells was lower than the amountproduced in control cells.

Example 31

In this example, evidence is presented showing the in vivo downregulation of two genes in mammalian cells by using "hairpin" RNAcatalysts in trans. The two genes studied were hypoxanthine guanosinephosphoribosyl transferase (HGPRT) and chloramphenicol acetyltransferase (CAT).

A. In Vivo Inhibition of HGPRT

A 15 nt target sequence in the HGPRT gene of the hamster was identified.The sequence is:

    AUUCC*GUCAUGGCGA                                            SEQ ID 18!

A "hairpin" catalytic RNA was designed according to the principles setforth herein to cleave this target sequence. The catalytic RNA sequence,designated "RHGPT," is shown in FIG. 37. RHGPT base pairs with thetarget sequence in the two regions flanking the CGUC cleavage sequence,and the "hairpin" portion of RHGPT has the same sequence as the"hairpin" portion of the (-)sTRSV catalytic RNA sequence shown inFIG. 1. Additional vector bases coding for restriction sites are presentat the 3' and 5' ends of RHGPT.

RHGPT and the target sequence were synthesized as described in Example 2and tested in vitro as described in Example 3. The results of the testshowed that the target sequence was cleaved in vitro by the engineered"hairpin" ribozyme RHGPT at the * in the sequence given above (which isafter nt 87 in the sequence of the HGPRT gene).

Next, DNA coding for ribozyme RHGPT was cloned into the BamHI/MluI sitesof the mammalian expression vector pHC (prepared as described in Example23). The DNA coding for RHGPT and HC was excised from the resultingplasmid with SmaI/SalI and cloned into SmaI/SalI digested pMSG (Example27) to give plasmid pMRHPT, a map of which is shown in FIG. 37. The DNAcoding for RHGPT is located downstream of the dexamethasone-inducibleMMTV promoter on pMRHPT and upstream of the "hairpin" autocatalyticcassette (HC) which is included so that RHGPT will be properlyterminated at the 3' end after the RNA is transcribed.

Another plasmid, pMR2HPT, was also prepared. This plasmid is identicalto pMRHPT, except that a G35→C mutation in RHGPT was made (numberingaccording to FIG. 42D). Accordingly, the resulting ribozyme produced bythis plasmid would be inactive (see Example 22).

Chinese hamster ovary cells (CHO) were co-transfected with one of theseplasmids and plasmid pMSG-dhfr. The method of transfection and plasmidpMSG-dhfr are described in Example 27. Transfectants were selected andamplified with methotrexate, also as described in Example 27. Aschematic drawing of the selection scheme is presented in FIG. 38.

The poison 8-azaquanine, which is allowed into cells by the HGPRTenzyme, was then added to the cells (80 μg/ml) both in the presence andabsence of dexamethasone (1 μM) which induces the HGPRT ribozyme. In theabsence of dexamethasone, cells with or without DNA coding for RHGPTdied as expected. In the presence of dexamethasone, control cells died,but colonies of survivors were seen for cells transfected with pMRHPT.All cells transformed with pMR2HPT (coding for the inactive ribozyme)died.

To quantitate the rate of survival, CHO cells were grown in gpt medium(see Example 27), except untransfected, uninduced CHO cells which weregrown in MEM. The concentration of 8-azaquanine used was 80 μg/ml, andthe concentration of dexamethasone used was 1 μM. One hundred cells wereplated per dish. Colonies of cells were stained with crystal violet. Theresults are presented below:

    ______________________________________    Cells            Percent Survival    ______________________________________    Untransfected,   100%*    uninduced    Transfected with 33%    pMRHPT and induced    Transfected with  0%    pMRHPT but uninduced    Transfected with  0%    pMR2HPT and induced    ______________________________________     *Results were normalized with untransfected, uninduced cells set at 100%.     Actual survival rate for these cells was 80%.

The results are consistent with reduced levels of HGPRT due to thecleavage of HGPRT mRNA by the engineered "hairpin" ribozyme RHGPT afterinduction with dexamethasone. The in vivo activity of the ribozyme isnot likely due to antisense effects, since the disabled ribozyme codedfor by pMR2HPT was ineffective in increasing resistance to 8-azaquanine.Note that the mutation in pMR2HPT is not in the area of the ribozymethat base pairs to the substrate, so binding to the substrate shouldoccur.

Next, an S1 nuclease assay was performed to observe the in vivo levelsof MRNA coding for HGPRT. The S1 nuclease assay was performed asdescribed in Example 27. The 148 nt probe which was used hybridizes toHGPRT mRNA and was prepared by transcribing plasmid pHPTPr, as describedin Example 27 using P³² -labelled CTP. Plasmid pHPTPr contains theantisense HGPRT sequence from nt 20-160 (Konecki et al., Nucleic AcidsRes., 10, 6763-75 (1982)) bridging the cleavage sequence at nt 87 clonedinto pTZ18R (U.S. Biochemical) between the EcoRI and HindIII sites.

The results are shown in FIG. 39. In FIG. 39, lane 3 contains RNA fromcells transfected with pMRHPT and pMSG-dhfr but not induced, and lane 4contains RNA from cells transfected with pMRHPT and pMSG-dhfr which wereinduced with dexamethasone. A 30% reduction in the level of MRNA wasobserved when the "hairpin" ribozyme was induced (compare lanes 3 and4). This shows that the engineered "hairpin" ribozyme RHGPT reducesHGPRT activity by lowering the amount of MRNA, and the likely mechanismis cleavage of the HGPRT mRNA.

B. In Vivo Lowering of CAT mRNA Levels,

From a series of in vitro experiments using the techniques described inExamples 2 and 3, the optimum target sequence in the CAT gene wasdetermined to be:

    5'-UWUCA*GUCAGUWGCUCAA-3'                                   SEQ ID 19!

with cleavage at (*), which is nt 320 of the CAT gene. The "hairpin"catalytic RNA designed to cleave this target sequence, designated"RCAT," is shown in FIG. 40. RCAT was designed according to theprinciples set forth herein. It base pairs with the target sequence inthe two regions flanking the AGUC cleavage sequence, and the "hairpin"portion of RCAT has the same sequence as the "hairpin" portion of the(-)sTRSV catalytic RNA sequence shown in FIG. 1. Additional vector basescoding for restriction sites are present at the 3' and 5' ends of RCAT.

DNA coding for RCAT was ligated to pHC (Example 23) which had been cutwith BamHI/MluI. The resulting plasmid was cut with SmaI/SalI to removethe fragment coding for RCAT and HC. This fragment was ligated to pMSG(Example 27) which had been cut with SmaI/SalI to give pMSGRCAT. NextpCAT (Promega) was cut with PvuI/ PstI, and the fragment containing theCAT gene under the control of the SV40 promoter and enhancer wasisolated on a 1% low-melting agarose gel (Nusieve). Then the fragmentwas blunt ended with Klenow fragment and ligated into the EcoRI-cut,blunt-ended pMSGRCAT to give the final plasmid pMCATRCAT (see FIG. 40).This plasmid contains DNA coding for RCAT under control of thedexamethasone-inducible MMTV promoter and the CAT gene under control ofthe SV40 promoter. Further, DNA coding for the autocatalytic "hairpin"ribozyme is located downstream of DNA coding for RCAT so that RCAT willbe properly terminated at the 3' end upon transcription of the RNA.

Plasmid pMCATRCAT was cut with NdeI and then used to transfect CHO cellsalong with XhoI-cut pMSG-dhfr as described in Example 27. Transfectantswere selected and amplified and an S1 nuclease assay performed, all asdescribed in Example 27. The probe used for the S1 nuclease assay wasRNA transcribed from the plasmid pCATP. This plasmid contains thesequence of the CAT gene from nt 260-372 (GenBank sequence) bridging thecleavage sequence at nt 320 cloned into the EcoRI/HindIII site of pTZ18R(U.S. Biochemical) in the antisense direction. The probe was prepared asdescribed in Example 27 using ³² P-CTP and was a total of 119 nt long.

The results of the S1 nuclease assay are presented in FIG. 41. Areduction in CAT MRNA was observed in cells transfected with pMCATRCATand pMSG-dhfr and induced with dexamethasone as compared to uninducedcells. Thus, the engineered "hairpin" catalyst RCAT did reduce the levelof CAT mRNA. However, further attempts to locate the mRNA cleavageproducts have so far failed, and no lowering of CAT enzymatic activitywas seen.

Example 32

Additional mutagenesis experiments were performed changing bases in(-)sTRSV RNA and its substrate. All substrate RNAs and catalytic RNAswere prepared as described in Example 2. Mutagenesis was carried outsimply by making the required base change in the synthetic DNA template.All catalytic RNAs had additional vector bases GGG at the 5' end, andall substrate RNAs had additional vector bases GCG at the 5' end. Thesebases are required for efficient transcription (Milligan et al., NucleicAcids Res., 15, 8783-98 (1987)), and the C near the 5' end of allsubstrates ensured at least one P³² -labelled C in the 5' cleavagefragment. The reference sequences were the unmutated catalytic andsubstrate sequences (see FIG. 1).

Substrate and catalytic RNAs were assayed for catalytic activity asdescribed in Example 3. Generally, the final concentrations of substrateRNA was 0.01 uM, and the final concentration of catalytic RNA was 0.0.1uM. Assays were done at 37° C. for times ranging from 15-30 minutes, anda zero-time control was always included. Reaction products were analyzedon 15% acrylamide/7M urea gels, autoradiography performed, and the bandscut from the gels and counted. The control (unmutatedribozyme/substrate) was assayed at the same time as all mutant catalyticRNAs and substrates.

Nucleotide changes made in the native, unmutated sequence of both thecatalytic RNA and the substrate showed a range of catalytic effects. Thenucleotide changes shown in FIG. 42A had very little effect on thecatalytic activity of the ribozyme (75-100% of the activity of theunmutated sequence), while other nucleotide changes (FIG. 42B) had anintermediate effect on catalytic activity (5-75% of the activity of theunmutated sequence). Those mutational changes resulting in very low orno catalytic activity (less than 5% of the activity of the unmutatedsequence) are shown in FIG. 42C. The results further define thetwo-dimensional structure of the (-)sTRSV catalytic complex. A revised"hairpin" structure for the catalytic complex is shown in FIG. 42D. InFIGS. 42A-D, the ribozyme is numbered consecutively 1-50 nt and thesubstrate 1-16 nt (see FIG. 42D). Also, in FIGS. 42A-D, upper caseletters are used for the ribozyme nucleotides and lower case letters areused for the substrate nucleotides.

In summary, the results show that Helices 2 and 3 (see FIG. 42D) are notcontinuous but have an unpaired base between them. Helix 4 was found tobe shorter and Loops II and IV were found to be larger than previouslypredicted by computer modeling. Also, certain bases in Loops II and IVwere found to be invariant. Helix 4 can be extended towards the closedend (Loop III) of the "hairpin" to give increased stability to theribozyme, and the sequence of Loop III can be mutated with retention ofcatalytic activity. While conventional base pairing interactions betweenLoop I on the ribozyme and Loop V of the substrate were not observed, anA→C mutation in ribozyme Loop I partly restored activity to a previouslyinactive c→a mutation in substrate Loop V, indicating that some type ofinteraction between these two bases may be occurring. Finally, the datashow that the first base pair upstream of the N*GUC cleavage sequencecannot be an A:U or U:A, it must be G:C or C:G. It is believed that theG:C or C:G base pair is necessary for stability of thecatalyst-substrate complex and that A:U or U:A base pairs can be used ifsufficient stability is provided by other means such as possiblylengthening Helix 1. Indeed, it has been found that A:U and U:A basepairs can be used at these positions in the synthetic autocatalyticcatalyst of the invention. Accordingly, in engineering a "hairpin"catalyst based on (-)sTRSV, the substrate RNA preferably contains thetarget sequence 5'-SN*GUC-3', where S is G or C and cleavage occurs atthe *.

The results will now be discussed in detail. First, Table I lists allsubstrate sequences successfully cleaved by a catalytic RNA designedaccording to the "hairpin" model so that the bases flanking the N*GUCcleavage sequence in the substrate were base paired to the catalyst.Cleavage occurred at the *. The lower case letters in Table I designateadditional vector sequences.

                  TABLE 1    ______________________________________    Helix2      LoopV   Helix1    ______________________________________     1   gcg    UGAC    A*GUC CUGUUU       SEQ ID 10!     2   gcg    UGAC    A*GUC CUGUUUUUUU   SEQ ID 9!     3   gcg    UGAC    A*GUC CUGUUUUUUUCGC                                           SEQ ID 10!     4   gcg    UGUC    A*GUC CUGUUU       SEQ ID 11!     5   gcg    UGAG    A*GUC CUGUUU       SEQ ID 12!     6    .sup. AAAC    A*GUC CCCAAC       SEQ ID 13!     7    .sup. UUUC    A*GUC AGUUGC       SEQ ID 14!     8   gcg    UUUC    A*GUC AGUUGCUCAA   SEQ ID 15!     9   gcg    CCCC    U*GUC CCCGAG       SEQ ID 16!    10   gcg    UGGG    U*GUC GACAUAgC     SEQ ID 17!    11   gcg    UGAC    A*GUC GUGUUU       SEQ ID 18!    12   gcg    UGAC    A*GUC AUGUUU       SEQ ID 19!    13   gcg    AGAG    C*GUC GGUAUUAA     SEQ ID 20!    14   gcg    AGAG    C*GUC GGUAUUAAGCGG                                           SEQ ID 21!    15   gcg    AGAG    C*GUC GGUAUUAAGC   SEQ ID 22!    16   gcg    UUUC    U*GUC GUUUAACU     SEQ ID 23!    17   gcg    UGAC    U*GUC CUGUUU       SEQ ID 24!    18   gcg    UGAC    C*GUC CUGUUU       SEQ ID 25!    19   gcg    UGAC    G*GUC CUGUUU       SEQ ID 26!    20   gcg    UGCC    C*GUC UGUUGUGUGA   SEQ ID 27!    21   gcg    UGCC    C*GUC UGUUGUGU     SEQ ID 28!    22   gcg    CCAC    U*GUC GAUCGA       SEQ ID 29!    23   gcg    CCAC    U*GUC GAUCGAG      SEQ ID 30!    24   gcg    AUUC    C*GUC AUGGCGA      SEQ ID 31!    25   gcg    AUUC    C*GUC AUGGC        SEQ ID 32!    26   gcg    AUGC    G*GUC ACUCAUUA     SEQ ID 33!    27   gcg    AUGC    G*GUC ACUCAU       SEQ ID 34!    28   gcg    AUCC    U*GUC CAUUCAA      SEQ ID 35!    29   gcg    AUCC    U*GUC CAUUCAAG     SEQ ID 36!    30   gcg    UUGG    U*GUC GACCUGAA     SEQ ID 37!    31   gcg    ACAG    C*GUC UGCUCC       SEQ ID 38!    32   gcg    UUGC    G*GUC GCUACG       SEQ ID 39!    33   gcg    UUGC    G*GUC GCUACGUC     SEQ ID 40!    34   gcg    UCUC    A*GUC ACUAUG       SEQ ID 41!    35   gcg    CACC    U*GUC AACAUAA      SEQ ID 42!    36   gcg    CACC    U*GUC AACAUAAUU    SEQ ID 43!    37    gc    GUGG    U*GUC UGUGGA       SEQ ID 44!    ______________________________________

As shown in Table I, every base pair in Helices 1 and 2 can be changedto any other base pair, and the substrate will be cleaved by theribozyme, except the base pair in Helix 2 adjacent to the N*GUC cleavagesequence (designated by S in FIG. 42D). When base G11 in the ribozymewas changed to C, catalytic activity was lost (see FIG. 42C). When asecond mutation (c4→g) was made in the substrate so that base pairingwas restored, catalytic activity was also restored. However, whencatalysts and substrates having A:U base pairs at these positions weretested, no catalytic activity was observed. A:U base pairs in thisposition were checked for catalytic activity with a variety ofsubstrates and corresponding ribozymes, and all of them were found to beinactive. Accordingly, this base pair must be G:C or C:G and cannot beA:U or U:A unless, as discussed above, other measures are taken tostabilize the substrate-catalyst complex.

As already demonstrated in Examples 18 and 21, it is possible to adjustthe length of Helix 1 to optimize the rate of cleavage. In particular,when the native sequence was extended four base pairs by adding four A:Ubase pairs to the open end of Helix 1, an increased rate of activity wasseen (Example 21). However, when three additional G:C base pairs wereadded in the present experiments, a large loss of activity occurred.This phenomenon was observed for numerous substrates and ribozymes.Accordingly, Helix 1 has an optimal length for each substrate used. Thisoptimal length can be determined by varying the length of Helix 1 tofind the length giving the maximum reaction rate. Generally, thisoptimal length will be from 6 to 12 base pairs. It is believed that whenHelix 1 is shorter than the optimal length, the ribozyme does not bendwell to the substrate, and a reduced cleavage rate is obtained. WhenHelix 1 is longer than the optimal length, the ribozyme remains bound tothe cleaved fragments of the substrate longer due to the increased basepairing so that a particular ribozyme molecule will cleave lesssubstrate in a given time period.

A type of "hinge" region, consisting of a single A base at position 15,is present between Helices 2 and 3 of the catalytic RNA (see FIG. 42D).When the A15:U49 potential base pair was changed to the compensatorybase pair U15:A49 by a double mutation, activity remained at nearly 100%(FIG. 42A). The single U49→A49 mutation, which would lead to an A:Amismatch, also had no effect on activity (FIG. 42A), showing that nobase pair was needed at this position. Accordingly, the results showthat a base pair does not exist between bases A15 and U49 of theribozyme.

Proof of the base pair C17:G47 was obtained previously (see Example 22),but the presumptive base pair next to it, C16:G48, could not be shown toexist. Both the catalyst containing the C:C mismatch and the catalystcontaining the reverse G:C base pair were inactive (FIG. 42C). Sincethis G and C are opposite each other in this position and are adjacent abase pair, it is highly likely they are actually base paired as well.The fact both the mismatch and reverse base pair were inactive suggeststhat the identity of the bases must be maintained in this position.Thus, it is likely that this base pair exists and must be C16:G48 asshown.

At the end of Helix 3 is the predicted base pair G19:C45. The catalystcontaining the C:C mismatch was inactive (FIG. 42C), and the catalystcontaining the reverse base pair C19:G45 was partially active (FIG.42B), showing that a base pair likely exists in this position.

Base pairing between A18:U46 is shown in FIG. 42D even thoughmutagenesis was not done. The existence of a base pair at this positionis likely since it would be located between two proven G:C base pairs.

Helix 4 is a shorter helix than predicted by straightforward computermodeling and two-dimensional energy minimization (compare FIGS. 1 and 19with FIG. 42D). This helix was previously shown to exist by showing thatthe base pair C27:G35 actually existed (see Example 22). The next basepair downstream, A28:U34, also exists. A catalyst containing thecompensatory double mutation to form a U28:A34 base pair was 100% active(FIG. 42A) and was 10 times more active than a catalyst containing thesingle mismatch mutation, U34→A (FIG. 42B).

The following results show that an active ribozyme is produced whenHelix 4 is extended and the sequence of Loop III is changed. As shown inFIG. 42A, Loop III was replaced with the common and very stable RNAhairpin sequence 5'-GGAC(UUCG)GUCC-3' SEQ ID 1! characterized by Tinocoand colleagues (Cheong et al., Nature, 346, 680-82 (1990)); Varani etal., Biochem., 30, 3280-89 (1991). As a result of this substitution,Helix 4 was extended by four base pairs and the GUU sequence of Loop IIIwas replaced with the sequence UUCG (see FIG. 42A). The resulting RNAcatalyst was active. In fact, the activity of this ribozyme was greaterthan that of the unmutated form. Further, the mutant ribozyme was morethermal stable. It remained active at 45° C., while the unmutated RNAcatalyst loses most of its activity at this temperature (see FIG. 43).

It was concluded from this experiment that Loop III does not have aconserved or invariant base sequence and that Helix 4 can be extendedtowards loop III by at least four base pairs without loss of activity.The four additional base pairs in Helix 4 should provide helixstabilization of this region. The secondary folding energy of Helix 4and Loop III in the native structure is +0.6 Kcal/mole, while that ofthe catalyst having the extended Helix 4 and the Loop III of thesequence UUCG was determined to be -11.1 Kcal/mole (methods described inExample 6). Thus, the presence of the Tinoco et al. hairpin sequenceincreases the folding energy by 11.7 Kcal/mole.

However, the simple replacement of Loop III with the sequence UUCG (seeFIG. 42C) gives an inactive ribozyme. This is believed to occur becausethe bases of Loop III help stabilize Helix 4.

When Loop III of the native ribozyme is cut between U31 and U32,activity is lost (see FIG. 42C). A likely explanation for this is thatwhen the loop is cut, Helix 4 opens up and catalytic activity is,consequently, lost. The "cut" ribozyme was prepared by synthesizing theribozyme in two parts and allowing the parts to anneal.

With the extension of Helix 4 by the Tinoco et al. hairpin sequence, thepotential base pair C29:G33 is between the two proven bases of Helix 4and the four base pairs of the Tinoco sequence. Accordingly, it seemedlikely that this base pair existed. However, both the single mutationG33→C and the double mutation C29:G33→G29:C33 were partially active (seeFIG. 42B) and had about the same levels of activity.

The existence of other base pairs in Helix 4 could not be shown. Thenext potential base pair upstream in Helix 4 was C25:G36. The doublemutation, G25:C36, was also inactive (FIG. 42C), indicating that a basepair does not actually exist. Although the base pair could exist withthe identity of the bases being required, this appears unlikely sincethe G and C are not directly opposite each other since there is A (A26)between the C and the next proven base pair. By the same sort ofanalysis, the next potential base pair upstream, A24:U37, was shown notto exist. When the mutation A24:U37→G24:C37 was made, no activity wasseen (FIG. 42C), indicating that a base pair does not exist at thisposition.

Further mutation studies showed that Loops II and IV are larger thanoriginally predicted by computer modeling and energy minimization(compare FIGS. 1 and 42D). Also, some of the bases in these loops arerequired for activity.

As already shown in Example 12, when bases A22, A23 and A24 in Loop IIwere mutated to GUC, a totally inactive ribozyme was obtained,indicating that one or all of these bases are essential. The singlemutation in Loop II of C25→G was inactive (see FIG. 42C). The doublemutation C25→G +G36→C was also inactive (see FIG. 42C), indicating thata base pair did not exist. When A26 was deleted, activity was lost,indicating that a base was needed in this position for activity.

Among two active mutations in this Loop II was the G21→C mutation whichgave significant, although somewhat reduced, activity. This mutation hadadditional significance since it converted the AGAA sequence in Loop 2to ACAA, and activity was largely retained. Loop 1 also has an AGAAsequence, but when this AGAA sequence was mutated to ACAA, activity wastotally lost. Thus, while these two loops have identical AGAA sequences,they are likely involved in very different functions in the ribozyme.

The other active mutation involved both Loops 1 and 2. This mutation wasthe double mutation of A20→C and A7→C which gave full activity.

In Loop IV, the single mutations U39→G and U39→C were active (see FIG.42A), indicating that this base was not essential for catalytic activityand was not involved in a base pair. The mutation U41→C gave 25%activity, indicating that this base is probably involved in some way inribozyme functioning. All other mutations in Loop IV were inactive.Inactive single mutations A38→G, A38→U, A40→G, A40→U, U42→C, delete U42,and A43→U were all inactive, as was the double mutation A43→U and U37→A.This result was significant because the possibility existed for a stemto occur in Loop IV: ##STR6## If this stem existed, the A43:U37alternate base pair of U43:A37 should have been active. Since it wasnot, it was concluded that a stem in Loop IV does not exist.

In Loop V, the loop formed by the cleavage sequence in the substratemolecule, the bases guc are not base paired to the catalyst. Also,substrates having the sequence GUC are 6-100 times more susceptible tocleavage than substrates in which one of these bases has been changed(see Example 25 and FIG. 42C).

All of the bases of Loop I in the ribozyme, the loop opposite Loop V,have been mutated. When base A7 was changed to a G or C and when baseA10 was changed to a G, the resulting catalysts were active (FIGS. 42Aand 42B). However, when base G8 was changed to a U or C or when base A9was changed to U, inactive ribozymes were produced (see FIG. 42C).

Potential base pairs between Loop V in the substrate and Loop I in theribozyme were tested. In particular, experiments to determine whether G8and A9 of Loop I were base paired with the corresponding bases insubstrate Loop V were performed. Mutations tested were G8→C:c8→g,G8→U:u7→g, and A9→U:u7→a. These mutations were inactive (see FIG. 42C),showing that, for the bases checked, no base pairs existed.

While the c9→a mutation in substrate Loop V was inactive (FIG. 42C) andthe A7→C mutation of ribozyme Loop I was fully active (FIG. 42A), thedouble mutation of c9→a and A7→C showed partial activity (see FIG. 42B).This suggests that some type of interaction may be occurring betweenbase c9 of the substrate and base A7 of the ribozyme which are oppositeeach other in the loops. The nature of this interaction is not know,except that it is not conventional base pairing.

A triple mutation comprising this same double mutation plus changingbase A20→C in the ribozyme showed the same level of activity as thedouble mutation (FIG. 42B). The triple mutation represents base changesin the ArMV sequence proposed to be catalytic (Gerlach and Haseloff,Gene, 82, 43-52 (1989)) and would suggest that a triple base interactionmight occur between the three bases. However, the catalyst containingthe triple mutation gave catalytic activity at the same level as thedouble mutation, arguing against a triple base interaction. The resultsobtained with the catalyst containing the double mutation, however,indicate that an interaction of some type takes place between substrateC9 and ribozyme A7.

Example 33

This example illustrates the use of an HIV-1-specific "hairpin" ribozymeto inhibit HIV-1 gene expression in human cells. A target sequencewithin the 5' leader sequence of the HIV-1 mRNA was chosen based on thehigh degree of conservation of the sequence among most known HIV-1isolates (see Example 29) and its presence in both early and late viralgene products. By cleaving at this site, the ribozyme prevents the HIV-1mRNA from being capped, and the uncapped mRNA is presumably exposed todegradation. In addition, uncapped mRNAs are poorly translated.

To separate antisense from cleavage effects, a disabled "hairpin"ribozyme was used as a control. The disabled ribozyme binds to the samesequence in the HIV-1 mRNA as the active ribozyme, but the disabledribozyme has no catalytic activity. Thus, any inhibition of HIV-1expression observed with the disabled ribozyme would be due to antisenseeffects.

The results of the experiments showed that about 70-80% inhibition ofHIV-1 expression was obtained when active "hairpin" ribozyme was used,whereas only about 10% inhibition was obtained when the disabledribozyme was used. Thus, most of the activity of the active "hairpin"ribozyme is attributable to its catalytic mode of action, not to itsantisense features.

A. Construction Of A Vector Coding For An HIV-1-Specific "Hairpin"Ribozyme

Plasmid pHR was used as a starting material to prepare a vector codingfor an HIV-1-specific "hairpin" ribozyme. The preparation of plasmid pHRis described in Examples 29 and 30. It contains a "hairpin" ribozymeupstream of the autocatalytic termination cassette. The "hairpin"ribozyme in pHR is designed to cleave a sequence in the 5' leadersequence of HIV-1 MRNA (See Example 29). The sequence is highlyconserved in HIV-1 isolates (see Example 29) and is present in late andearly viral gene products.

The HIV-1-specific ribozyme and the autocatalytic cassette were removedfrom pHR by cleaving pHR with XhoI and HindIII. The resulting fragmentcontaining the ribozyme and autocatalytic cassette was cloned intoplasmid pHBApr-1 which had been cleaved with SalI and HindIII to produceplasmid pB-HR. Plasmid pB-HR is a mammalian expression vector which hasthe HIV-1-specific "hairpin" ribozyme under the control of the veryactive beta-actin promoter. The preparation of plasmid pHBApr-1 isdescribed in Gunning et al., Proc. Natl. Acad. Sci. USA, 84, 4831-4835(1987).

B. Construction Of A Vector Coding For A Disabled "Hairpin" Ribozyme

An inactive (disabled) ribozyme was used as a control to distinguish anyantisense activity of the active "hairpin" ribozyme from its catalyticactivity. The disabled control ribozyme (designated dRHIV) has the samesequence as the active "hairpin" ribozyme, except that the AAA sequencelocated in loop 2 between helices 3 and 4 has been changed to CGU (thesequences of the active and disabled ribozymes are given in FIG. 44).

Disabled ribozyme dRHIV was prepared as described in Example 2, andassayed for catalytic activity as described in Examples 3 and 29 againstthe HIV-1 target sequence described in Example 29. It was found to haveno catalytic activity.

Comparative binding studies were carried out on disabled ribozyme dRHIVand active ribozyme RHIV (prepared as described in Example 29).Unlabelled disabled or active ribozyme (0.05 uM) was mixed with P³² -CTPlabelled substrate SHIV (preparation described in Example 29 (0.01 uM)to give an excess of substrate (1:2 ribozyme: substrate ratio) in thepresence of 40 mM Tris, pH 7.5, and 2 mM spermidine (note no Mg++ waspresent because only binding is desired, not cleavage). This was heatedat 90° C. for 2 minutes, incubated on ice for 2 minutes, incubated at37° C. for 10 minutes, and incubated on ice for 2 hours. Finally, theresultant material was run on a 15% acrylamide gel in 40 mM Trisacetate, pH 7.5, and 12 mM Mg acetate at 4° C. This is a non-denaturinggel which will not disrupt complexes between the ribozymes andsubstrate.

The results are shown in FIG. 45. In FIG. 45, Lane 1 contains substratealone with no added ribozyme. Lane 2 contains a P³² CTP-labelled activeribozyme used as a mobility reference. Lane 3 contains unlabelledHIV-1-specific active ribozyme and a P³² -CTP-labelled substrate (9%bound). Lane 4 contains disabled ribozyme and a P³² -CTP-labelledsubstrate (18% bound). Arrows indicate the mobility of (S) substrate,(R) ribozyme and (RS) the ribozyme/substrate complex which forms whenactive or disabled ribozyme binds to substrate.

From FIG. 45, it can be seen that binding to substrate occurred for boththe disabled and active ribozymes. Further, it can be seen from theintensity of the R and RS bands that approximately the same amount ofbinding to substrate occurred for both the disabled and activeribozymes. It can be concluded, therefore, that the change from AAA inthe active ribozyme to CGU in the disabled ribozyme did not decreasebinding of the disabled ribozyme to substrate.

The combined results show that dRHIV is catalytically inactive, but thatits binding to substrate is unaltered as compared to the activeribozyme. Disabled ribozyme dRHIV is, thus, a good control fordistinguishing the inhibition of HIV-1 expression by the active ribozymewhich is due to catalytic action from that due to antisense action.

Plasmid pdHR containing disabled ribozyme dRHIV was prepared asdescribed in Examples 29 and 30 for plasmid pHR. Thus, plasmid pdHRcontains the disabled "hairpin" ribozyme upstream of the autocatalytictermination cassette. The disabled ribozyme and the cassette wereremoved from pdHR by cleaving pdHR with XhoI and HindIII and cloned intoplasmid pHBApr-1 which had been cleaved with SalI and HindIII to produceplasmid pB-dHR. Plasmid pB-dHR is a mammalian expression vector whichhas the disabled "hairpin" ribozyme under the control of the very activebeta-actin promoter.

C. Inhibition Of HIV-1 Expression In Human Cells By "Hairpin" Ribozyme

1. Transfection Procedure

HeLa cells Gey et al., Cancer Research, 12, 264-65 (1952)! werepropagated in DMEM medium (GIBCO) containing 10% fetal bovine serum(FBS), 100 μg/ml penicillin-streptomycin, 2 mM L-glutamine and 1 mMsodium pyruvate. Cells were plated at approximately 70% confluency in a12-well plate (1×10⁵ cells/well) one day prior to transfection. Beforetransfection, the medium was replaced with 2 ml of fresh DMEM containing10% FBS, 100 μg/ml penicillin-streptomycin, 2 mM L-glutamine and 1 mMsodium pyruvate. Calcium-phosphate-precipitated plasmid DNA (20 μg) wasadded. DNA concentration in each reaction was normalized by adding pUC19DNA Yanisch-Perron et al., Gene, 33, 103 (1985)!. After 24 hours, themedium was removed and the cells washed three times with 1× PBS(composition given in J. Exp. Med., 98, 167 (1954)). The cultures werethen maintained in 2 ml of DMEM containing 10% FBS, 100 μg/mlpenicillin-streptomycin, 2 mM L-glutamine and 1 mM sodium pyruvate for48 hours, after which time the cells were harvested.

2. Expression Of The Active Ribozyme

Ribozyme expression in the HeLa cells was confirmed by dot blot analysiswith total cellular RNA from pB-HR-transfected HeLa cells. Total RNAfrom untransfected or pB-HR-transfected cells was isolated using a rapidprocedure. First, the cells were washed twice in ice-cold 1× PBS withoutcalcium and magnesium and then lysed by adding 10 mM EDTA (pH 8.0) and0.5% SDS. Then, 0.1M sodium acetate (pH 5.2) and 10 mM EDTA were addedto the lysed cells. Total RNA was recovered from the cell lysates by asingle extraction with water-saturated phenol followed by multipleethanol precipitations. To remove the template DNA, the isolated RNA wassubjected to DNAse I treatment performed as follows. The pellet wasresuspended in 200 ul of TE (10 mM Tris, pH 7.5, 1 mM EDTA) and treatedwith RNAse-free pancreatic DNAse-I (Boehringer Mannheim) at a finalconcentration of 2 ug/ml in 10 mM MgCl₂, 0.1 mM dithiothreitol and 10 mMRNAse inhibitor (Promega) for 60 minutes at 37° C. The reaction wasstopped by adding 10 mM EDTA and 0.2% SDS, and the RNA was extracted byphenol:chloroform treatment followed by ethanol precipitation. Therecovered total RNA was redissolved in DEPC-treated water (treatmentdescribed in Sambrook et al., Molecular Cloning (1985)), and 20 μg wereimmobilized onto a GeneScreen Plus membrane (Dupont) by gentle suctionwith blotting manifold (BRL). The membrane was then probed with a 5'g-³² P-radiolabeled synthetic oligodeoxy-ribonucleotide complementary tothe ribozyme RNA (50 nt).

The results are presented in FIG. 46. It is clear from FIG. 46 that theHeLa cells do in fact express the HIV-1-specific ribozyme since ribozymeRNA was present in the total RNA extracted from pB-HR-transfected HeLacells, but not in the total RNA extracted from untransfected cells.HIV-1-specific ribozyme RNA was also not observed in total RNA extractedfrom HeLa cells transfected with pHbAPr-1 (data not shown).

3. Inhibition Of Expression Of HIV-1 By "Hairpin" Ribozyme

HeLa cells were transfected with three plasmids: 1) pHXB2gpt; 2)pC15-CAT; and 3) pHBApr-1 (control) or a ribozyme-containing plasmid(pB-HR or pB-dHR). Plasmid pHXB2gpt is an HIV-1 plasmid. It istranslated into viral proteins, including tat and gag, and codes for thetarget sequence with which the active and disabled ribozymes aredesigned to bind. The preparation and properties of pHXB2gpt aredescribed in Ratner et al., AIDS Res. Hum. Retrovir., 3, 57-69 (1987).

In plasmid pC15-CAT the chloramphenicol acetyl transferase (CAT) gene islinked to the LTR promoter; this promoter is activated by the tatprotein. See Arya et al., Science, 229, 69-73 (1985). Plasmid pC15-CATis available from Biotech Research Laboratories, Inc., Rockville, Md. Amap of it is provided in Arya et al., supra.

In the design of the assay, if the ribozyme produced by pB-HR iscatalytic, then it will cleave the HIV-1 mRNA causing reduced tat andgag expression. Reduced tat production would result in reducedactivation of the LTR promoter and would be reflected in reducedproduction of CAT enzyme. Reduction in the amount of gag would bereflected in a reduced level of p24 antigen.

To assay for CAT activity, extract were prepared by freeze-thawing theharvested transfected cells. The cell lysates were assayed for CATenzyme activity as described in Arya et al., Science, 229, 69-73 (1985).For quantitation, the unacetylated and acetylated forms of ¹⁴ C!chloramphenicol were cut out of the thin layer chromatogram, and countedin a scintillation counter. The results are presented in FIG. 47A. Thevalues are expressed as percent of control value (the control consistedof HeLa cells transfected with pHXB2gpt, pC15CAT and pHBApr-1 (lackingribozyme sequences)). Transfection experiments were performed three ormore times, and the data are presented as the mean +/- SD of the mean.

To assay for p24 antigen, the Coulter HIV-1 p24 ELISA kit was usedaccording to the manufacturer's instructions to quantitate p24 viralcore antigen. Absorbance was read at 450 nm using an ELISA plate reader.Viral protein concentration in the culture supernatant was determinedfrom the absorbance using a standard curve. The values were thenexpressed as percentage of control (the control consisted of HeLa cellstransfected with pHXB2gpt, pC15CAT and pHBApr-1 (lacking ribozymesequences)). The inhibition of expression of p24 was used to determinethe effectiveness of the ribozyme as an inhibitor of HIV-1 replicationand expression. The results are shown in FIG. 47B.

In FIGS. 47A and 47B, A is the control, i.e., HeLa cells transfectedwith pHXB2gpt, pC15CAT and pHBApr-1 (lacking ribozyme sequences). B isHeLa cells transfected with pHXB2gpt, pC15CAT and pB-HR (1:5 ratio ofpHXB2gpt:pB-HR). C is HeLa cells transfected with pHXB2gpt, pC15CAT andpB-HR (1:10 ratio of pHXB2gpt:pB HR). D is HeLa cells transfected withpHXB2gpt, pC15CAT and pB-dHR (1:10 ratio of pHXB2gpt:pB-d-HR).

As can be seen from FIGS. 47A and 47B, the expression of activeHIV-1-specific ribozyme in this transient assay inhibited HIV-1expression and virus production significantly (about 70-80 %). The ratiogiven above (1:10 pHXB2gpt:pB-HR) was the optimum.

As noted earlier, the disabled ribozyme was used to distinguish betweenantisense and catalytic modes of action for the ribozyme. As shown inFIGS. 47A and 47B, the disabled ribozyme gave only about 10% inhibitionas compared to the 70 to 80% inhibition by the active ribozyme. Theseresults show that the inhibition of HIV-1 replication and expressionobserved is mainly attributable to the catalytic property of theribozyme and not to its antisense features.

4. Ribozyme Specificity

Plasmid pTAT was transfected into HeLa cells along with pC15-CAT andpB-HR as described above, and CAT activity and p24 antigen determined asdescribed above. The preparation of pTAT is described in Knight et al.,Science, 236 , 837-840 (1987). This plasmid produces tat, but lacks thetarget sequence for the HIV-1-specific ribozyme. By substituting pTATfor pHXB2gpt, the ribozyme target was eliminated but tat protein wasproduced which could transactivate the LTR promoter.

The results are shown in FIG. 48. In FIG. 48, A is HeLa cellstransfected with pTAT and pC15CAT. B is HeLa cells transfected withpTAT, pC15CAT and pB-HR (at a ratio of 1:10 pTAT:pB-HR).

As can be seen in FIG. 48, no inhibition of CAT gene expression wasobserved when PTAT was used instead of pHXB2gpt. These resultsdemonstrate that the HIV-1-specific ribozyme inhibits gene expressiononly in the context of its appropriate target sequence.

Example 34

In this example, the use of the "hairpin" ribozyme RHIV (described inExample 29) to specifically lower levels of expression of HIV-1sequences in human cells (HeLaT4+) is illustrated. The plasmid used todeliver the ribozyme was pdRHIV (shown in FIG. 49). This plasmid wasconstructed by inserting the SmaI/SalI fragment from plasmid pHR (seeFIG. 34; preparation of pHR described in Examples 29 and 30), whichcontains the HIV-1-specific "hairpin" ribozyme upstream of the hairpinautocatalytic cassette, into the SmaI/XhoI sites of plasmid pMSG-dhfr(described in Example 27) to give the plasmid pdRHIV. In pdRHIV, theribozyme and autocatalytic cassette are under the control of thedexamethasone-inducible MMTV LTR promoter (see FIG. 49).

The cells were also transfected with plasmid pCDLTR which provides thetarget for the ribozyme. Plasmid pCDLTR is illustrated in FIG. 50. Toprepare pCDLTR, additional nucleotides were synthesized as described inExample 2 and ligated into the HindIII site of pC15-CAT to extend theLTR sequence to +132, which gave the plasmid pCDLTR (see FIG. 50). Thisplasmid, thus, contains the first 132 nt of the HIV-1 LTR followed bythe CAT gene. The LTR sequence contains the target sequence for theHIV-1-specific ribozyme (from nt 107-122 in the HIV-1 LTR). The LTRsequence also contains the TAR sequence which is recognized by the tatprotein and activated to give expression of the CAT gene.

Three plasmids, pdRHIV, pCDLTR and the plasmid pTAT (see Example 33)were stably transfected into HeLaT4+ cells Maddon et al., Cell, 47, 333(1986)! using the procedure described in Example 27. This created a cellin which the tat protein should be expressed and cause expression of theCAT gene by transactivation of the HIV-1 LTR promoter in pCDLTR. Whenribozyme production is induced by dexamethasone, the CAT transcript(which contains the HIV-1 LTR sequence) should be cleaved, removing the5' cap, with the result that CAT enzyme activity is reduced (see Example33). A flowchart depicting this scheme is shown in FIG. 51.

Transfected cells (3×10⁵) were induced by incubation with 1 uMdexamethasone for 72 hours. The induced transfected cells were shown toproduce the HIV-1 ribozyme by a S1 nuclease protection assay (performedas described in Example 27) (data not shown).

CAT activity was measured as follows. Induced and uninduced cells wereharvested and then resuspended in 0.2 ml of 0.25M Tris, pH 8.0. Theresuspended cells were homogenized with a Potter-Elvejhem homogenizerand the homogenate was centrifuged at 1200 rpm for 10 minutes at 4° C.The supernatant was heated at 65° C. for 2 minutes. Then, 2 microlitersof ¹⁴ C chloramphenicol and 20 microliters of 4 mM N-butyl-CoA wereadded, and the mixture was incubated at 37° C. for 2.5 hours. Then 0.3ml of xylene was added to dissolve the products which retain the butyrylgroup and then removed with a pipette before counting in a scintillationcounter.

Uninduced cells produced tat protein as determined by the ability of thecells to express high levels of CAT activity (see Table 2 below). Uponinduction of the ribozyme with dexamethasone, CAT enzyme levels werereduced by 42% (see Table 2), indicating that the ribozyme was effectiveagainst the HIV-1 target sequence in human cells. Although a disabledribozyme was not used in these experiments as a control to assay forantisense activity, the results of the experiment described in Example33 indicate that antisense effects can be expected to be small.

                  TABLE 2    ______________________________________             EXPERIMENT (cpm)             1     2         3       4    ______________________________________    Uninduced  126,819 33,993    67,491                                       62,633    Induced    42,199  23,443    49,841                                       35,950    % Reduction               70%     31%       26%   43%    ______________________________________     Average % reduction = 42.5%

As various changes could be made in the above described products andmethods without departing from the scope of the invention, it isintended that all matter contained in the above description or shown inaccompanying drawings shall be interpreted as illustrative and shall notbe interpreted in a limiting sense.

    __________________________________________________________________________    SEQUENCE LISTING    (1) GENERAL INFORMATION:    (iii) NUMBER OF SEQUENCES: 90    (2) INFORMATION FOR SEQ ID NO:1:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 12 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1:    GGACUUCGGUCC12    (2) INFORMATION FOR SEQ ID NO:2:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 10 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2:    GACAGUCCUG10    (2) INFORMATION FOR SEQ ID NO:3:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 15 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3:    GUGACAGUCCUGUUU15    (2) INFORMATION FOR SEQ ID NO:4:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 13 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4:    AGAAACACACGUU13    (2) INFORMATION FOR SEQ ID NO:5:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 36 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:5:    ACCAGAGAAACACACGUUGUGGUAUAUUACCUGGUA36    (2) INFORMATION FOR SEQ ID NO:6:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 18 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:6:    CACGGACUUCGGUCCGUG18    (2) INFORMATION FOR SEQ ID NO:7:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 45 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:7:    ACCAGAGAAACACACGGACUUCGGUCCGUGGUAUAUUACCUGGUA45    (2) INFORMATION FOR SEQ ID NO:8:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 68 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8:    TACCAGGTAATATACCACAACGTGTGTTTCTCTGGTTGACTTCTCTGTTTCTATAGTGAG60    TCGTATTA68    (2) INFORMATION FOR SEQ ID NO:9:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 34 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:9:    AAACAGGACTGTCACGCTATAGTGAGTCGTATTA34    (2) INFORMATION FOR SEQ ID NO:10:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 17 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:10:    GCGUGACAGUCCUGUUU17    (2) INFORMATION FOR SEQ ID NO:11:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 15 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:11:    GAAACAGUCCCCAAC15    (2) INFORMATION FOR SEQ ID NO:12:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 50 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:12:    GUUGGGAGAAGUUUACCAGAGAAACACACGUUGUGGUAUAUUACCUGGUA50    (2) INFORMATION FOR SEQ ID NO:13:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 15 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:13:    GUUUCAGUCAGUUGC15    (2) INFORMATION FOR SEQ ID NO:14:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 17 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:14:    GUUUCAGUCAGUUGCUC17    (2) INFORMATION FOR SEQ ID NO:15:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 21 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:15:    GGGUUUCAGUCAGUUGCUCAA21    (2) INFORMATION FOR SEQ ID NO:16:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 29 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:16:    GGAATTCACCCGTCAGTTTTTAATACTGC29    (2) INFORMATION FOR SEQ ID NO:17:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 28 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:17:    TGGATCCATTCTAGTATTTTGAGCTTCT28    (2) INFORMATION FOR SEQ ID NO:18:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 15 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:18:    AUUCCGUCAUGGCGA15    (2) INFORMATION FOR SEQ ID NO:19:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 18 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:19:    UUUCAGUCAGUUGCUCAA18    (2) INFORMATION FOR SEQ ID NO:20:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 21 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:20:    GCGUGACAGUCCUGUUUUUUU21    (2) INFORMATION FOR SEQ ID NO:21:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 24 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:21:    GCGUGACAGUCCUGUUUUUUUCGC24    (2) INFORMATION FOR SEQ ID NO:22:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 17 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:22:    GCGUGUCAGUCCUGUUU17    (2) INFORMATION FOR SEQ ID NO:23:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 17 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:23:    GCGUGAGAGUCCUGUUU17    (2) INFORMATION FOR SEQ ID NO:24:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 15 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:24:    GAAACAGUCCCCAAC15    (2) INFORMATION FOR SEQ ID NO:25:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 15 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:25:    GUUUCAGUCAGUUGC15    (2) INFORMATION FOR SEQ ID NO:26:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 21 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:26:    GCGUUUCAGUCAGUUGCUCAA21    (2) INFORMATION FOR SEQ ID NO:27:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 17 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:27:    GCGCCCCUGUCCCCGAG17    (2) INFORMATION FOR SEQ ID NO:28:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 19 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:28:    GCGUGGGUGUCGACAUAGC19    (2) INFORMATION FOR SEQ ID NO:29:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 17 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:29:    GCGUGACAGUCGUGUUU17    (2) INFORMATION FOR SEQ ID NO:30:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 17 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:30:    GCGUGACAGUCAUGUUU17    (2) INFORMATION FOR SEQ ID NO:31:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 19 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:31:    GCGAGAGCGUCGGUAUUAA19    (2) INFORMATION FOR SEQ ID NO:32:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 23 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:32:    GCGAGAGCGUCGGUAUUAAGCGG23    (2) INFORMATION FOR SEQ ID NO:33:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 21 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:33:    GCGAGAGCGUCGGUAUUAAGC21    (2) INFORMATION FOR SEQ ID NO:34:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 19 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:34:    GCGUUUCUGUCGUUUAACU19    (2) INFORMATION FOR SEQ ID NO:35:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 17 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:35:    GCGUGACUGUCCUGUUU17    (2) INFORMATION FOR SEQ ID NO:36:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 17 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:36:    GCGUGACCGUCCUGUUU17    (2) INFORMATION FOR SEQ ID NO:37:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 17 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:37:    GCGUGACGGUCCUGUUU17    (2) INFORMATION FOR SEQ ID NO:38:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 21 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:38:    GCGUGCCCGUCUGUUGUGUGA21    (2) INFORMATION FOR SEQ ID NO:39:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 19 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:39:    GCGUGCCCGUCUGUUGUGU19    (2) INFORMATION FOR SEQ ID NO:40:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 17 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:40:    GCGCCACUGUCGAUCGA17    (2) INFORMATION FOR SEQ ID NO:41:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 18 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:41:    GCGCCACUGUCGAUCGAG18    (2) INFORMATION FOR SEQ ID NO:42:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 18 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:42:    GCGAUUCCGUCAUGGCGA18    (2) INFORMATION FOR SEQ ID NO:43:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 16 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:43:    GCGAUUCCGUCAUGGC16    (2) INFORMATION FOR SEQ ID NO:44:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 19 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:44:    GCGAUGCGGUCACUCAUUA19    (2) INFORMATION FOR SEQ ID NO:45:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 17 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:45:    GCGAUGCGGUCACUCAU17    (2) INFORMATION FOR SEQ ID NO:46:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 18 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:46:    GCGAUCCUGUCCAUUCAA18    (2) INFORMATION FOR SEQ ID NO:47:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 19 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:47:    GCGAUCCUGUCCAUUCAAG19    (2) INFORMATION FOR SEQ ID NO:48:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 19 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:48:    GCGUUGGUGUCGACCUGAA19    (2) INFORMATION FOR SEQ ID NO:49:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 17 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:49:    GCGACAGCGUCUGCUCC17    (2) INFORMATION FOR SEQ ID NO:50:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 17 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:50:    GCGUUGCGGUCGCUACG17    (2) INFORMATION FOR SEQ ID NO:51:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 19 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:51:    GCGUUGCGGUCGCUACGUC19    (2) INFORMATION FOR SEQ ID NO:52:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 17 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:52:    GCGUCUCAGUCACUAUG17    (2) INFORMATION FOR SEQ ID NO:53:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 17 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:53:    GCGCACCUGUCACAUAA17    (2) INFORMATION FOR SEQ ID NO:54:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 19 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:54:    GCGCACCUGUCACAUAAUU19    (2) INFORMATION FOR SEQ ID NO:55:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 16 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:55:    GCGUGGUGUCUGUGGA16    (2) INFORMATION FOR SEQ ID NO:56:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 16 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:56:    UGCCCGUCUGUUGUGU16    (2) INFORMATION FOR SEQ ID NO:57:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 50 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:57:    AAACAGAGAAGUCAACCAGAGAAACACACGUUGUGGUAUAUUACCUGGUA50    (2) INFORMATION FOR SEQ ID NO:58:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 14 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:58:    UGACAGUCCUGUUU14    (2) INFORMATION FOR SEQ ID NO:59:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 359 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:59:    UGGCCUACACGAAAGGCCAGACUACUCAGGCACUCCUGCUUUGUCCUGACAGUCCACCGG60    CUUUCGGUGGUGCAUUUGAUCACUUGGCACGACGCAUCGCAUCCCCAGACGAUGGAGCAA120    CCUCCACCUCUAACAUCGGAAGCACACCCGCGCCGCCACAUCGAUCAGUUCCGCAUGGUC180    CAUUAUAUGGUGUUGCACACAAAGAGACCAACUGAAGAGACAAACAACACAGUAACCAAG240    GGCCUAGAGCGUAAUCGCCGCUGCCCCAUAAGAGUAAGCUGUACCUUCAAACUCUCUGGC300    GCGGAGAUGUGAUACGCGCCGGCCCCGCUUAGGUUUAACAAGAUCGGGCUAUGGGACAG359    (2) INFORMATION FOR SEQ ID NO:60:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 50 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:60:    AAACAGAGAAGUCAACCAGAGAAACACACGUUGUGGUAUAUUACCUGGUA50    (2) INFORMATION FOR SEQ ID NO:61:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 14 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:61:    UGACUGUCCUGUUU14    (2) INFORMATION FOR SEQ ID NO:62:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 14 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:62:    UGACCGUCCUGUUU14    (2) INFORMATION FOR SEQ ID NO:63:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 50 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:63:    AAACAGAGAACUCAACCAGAGAAACACACGUUGUGGUAUAUUACCUGGUA50    (2) INFORMATION FOR SEQ ID NO:64:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 14 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:64:    UGAGAGUCCUGUUU14    (2) INFORMATION FOR SEQ ID NO:65:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 50 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:65:    GUUGGGAGAAGUUUACCAGAGAAACACACGUUGUGGUAUAUUACCUGGUA50    (2) INFORMATION FOR SEQ ID NO:66:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 14 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:66:    AAACAGUCCCCAAC14    (2) INFORMATION FOR SEQ ID NO:67:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 16 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:67:    UUUCAGUCAGUUGCUC16    (2) INFORMATION FOR SEQ ID NO:68:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 14 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:68:    UUUCAGUCAGUUGC14    (2) INFORMATION FOR SEQ ID NO:69:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 18 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:69:    UUUCAGUCAGUUGCUCAA18    (2) INFORMATION FOR SEQ ID NO:70:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 16 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:70:    AGAGCGUCGGUAUUAA16    (2) INFORMATION FOR SEQ ID NO:71:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 14 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:71:    UGGGUGUCGACAUA14    (2) INFORMATION FOR SEQ ID NO:72:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 18 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:72:    UGACAGUCCUGUUUUUUU18    (2) INFORMATION FOR SEQ ID NO:73:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 50 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:73:    NNNNNNAGAANNNNACCAGAGAAACACACGUUGUGGUAUAUUACCUGGUA50    (2) INFORMATION FOR SEQ ID NO:74:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 14 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:74:    NNNNNGUCNNNNNN14    (2) INFORMATION FOR SEQ ID NO:75:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 85 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:75:    NNNNNNNUGACAGUCCUGUUUCCUCCAAACAGAGAAGUCAACCAGAGAAACACACGUUGU60    GGUAUAUUACCUGGUANNNNNNNNN85    (2) INFORMATION FOR SEQ ID NO:76:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 133 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:76:    CTCGAGGGCCCGCGGCCGGCAGGCCTGGTACCACGCGTGACAGTCCTGTTTCCTCCAAAC60    AGAGAAGTCAACCAGAGAAACACACGTTGTGGTATATTACCTGGTAGTCGAGGGATCTTT120    GTGAAGGAACCTT133    (2) INFORMATION FOR SEQ ID NO:77:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 161 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:77:    GGGAAAGCTTGCATGCCTGCAGGTCGACTACCAGGTAATATACCACAACGTGTGTTTCTC60    TGGTTGACTTCTCTGTTTGGAGGAAACAGGACTGTCACGCGTGGTACCAGGCCTGCCGGC120    CGCGGGCCCTCGAGGGATCCCCGGGTACCGAGCTCGAATTC161    (2) INFORMATION FOR SEQ ID NO:78:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 16 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:78:    TGCCCGTCTGTTGTGT16    (2) INFORMATION FOR SEQ ID NO:79:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 14 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:79:    TGCCCGTCTGTTGT14    (2) INFORMATION FOR SEQ ID NO:80:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 16 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:80:    TGCCCGTCTGTTATGT16    (2) INFORMATION FOR SEQ ID NO:81:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 14 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:81:    TGCCCATCTGTTGT14    (2) INFORMATION FOR SEQ ID NO:82:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 59 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:82:    GGGAGUCACACAACAAGAAGGCAACCAGAGAAACACACGUUGUGGUAUAUUACCUGGUA59    (2) INFORMATION FOR SEQ ID NO:83:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 19 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:83:    GCGUGCCCGUCUGUUGUGU19    (2) INFORMATION FOR SEQ ID NO:84:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 189 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:84:    GGGAATTCGAGCTCGGTACCCGGGGATCCCTCGAGGATCCACACAACAAGAAGGCAACCA60    GAGAAACACACGTTGTGGTATATTACCTGGTACGCGTGACAGTCCTGTTTCCTCCAAACA120    GAGAAGTCAACCAGAGAAACACACGTTGTGGTATATTACCTGGTAGTCGACCTGCAGGCA180    TGCAAGCTT189    (2) INFORMATION FOR SEQ ID NO:85:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 159 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:85:    CCCGGGGATCCCTCGAGGATCCACACAACAAGAAGGCAACCAGAGAAACACACGTTGTGG60    TATATTACCTGGTACGCGTGACAGTCCTGTTTCCTCCAAACAGAGAAGTCAACCAGAGAA120    ACACACGTTGTGGTATATTACCTGGTAGTCGACCTCGAG159    (2) INFORMATION FOR SEQ ID NO:86:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 153 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:86:    GCTAGCCCGGGGATCCTCGCCATAGAAGAATACCAGAGAAACACACGTTGTGGTATATTA60    CCTGGTACGCGTGACAGTCCTGTTTCCTCCAAACAGAGAAGTCAACCAGAGAAACACACG120    TTGTGGTATATTACCTGGTAGTCGACCTCGAGG153    (2) INFORMATION FOR SEQ ID NO:87:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 150 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:87:    CCCGGGGATCCTTGAGCAACTAGAAGAAAACCAGAGAAACACACGTTGTGGTATATTACC60    TGGTAACGCGTGACAGTCCTGTTTCCTCCAAACAGAGAAGTCAACCAGAGAAACACACGT120    TGTGGTATATTACCTGGTAGTCGACCTCGA150    (2) INFORMATION FOR SEQ ID NO:88:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 51 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:88:    NNNNNNNAGAASNNNACCAGAGAAACACACGUUGUGGUAUAUUACCUGGUA51    (2) INFORMATION FOR SEQ ID NO:89:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 15 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:89:    NNNSNGUCNNNNNNN15    (2) INFORMATION FOR SEQ ID NO:90:    (i) SEQUENCE CHARACTERISTICS:    (A) LENGTH: 18 base pairs    (B) TYPE: nucleic acid    (C) STRANDEDNESS: single    (D) TOPOLOGY: linear    (ii) MOLECULE TYPE: RNA (genomic)    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:90:    CACGGACUUCGGUCCGUG18    __________________________________________________________________________

We claim:
 1. A host cell transformed with a vector operatively linked toexpression control sequences containing a nucleotide sequence which ontranscription gives rise to a compound comprising the structure:##STR7## wherein each of U, G, A and C represents a ribonucleotide andwherein each * represents a base pair and wherein each straight linerepresents a phosphodiester bond.
 2. The host cell of claim 1 furthercomprising a compound comprising the structure. ##STR8##
 3. The hostcell of claim 1 wherein the host cell is a prokaryotic cell, aneukaryotic cell, a plant cell, an animal cell or a human cell.
 4. Amethod of using a vector operatively linked to expression controlsequences containing a nucleotide sequence which on transcription givesrise to a compound comprising the structure: ##STR9## wherein each of U,G, A and C represents a ribonucleotide and wherein each * represents abase pair and wherein each straight line represents a phosphodiesterbond to produce a compound in a host cell which comprises: introducingthe vector into a suitable host cell and culturing the host cell underconditions to express the transfer vector so as to thereby produce thecompound in the host cell.
 5. The method of claim 4 wherein the vectoroperatively linked to expression control sequences further contains asecond nucleotide sequence which on transcription gives rise to acompound comprising the structure: ##STR10## .
 6. A method ofinactivating an RNA target sequence which comprises contacting the RNAtarget sequence with a compound comprising the structure: ##STR11##wherein each of U, G, A and C represents a ribonucleotide and whereineach * represents a base pair and wherein each straight line representsa phosphodiester bond.
 7. The method of claim 6 further comprisingcontacting the HIV RNA target sequence with a compound comprising thestructure: ##STR12## .
 8. The method of claim 6, wherein theinactivation of the target RNA occurs in vitro in cell culture.
 9. Themethod of claim 6, wherein the inactivation of the target RNA occurs invivo.
 10. A compound comprising the structure: ##STR13## and furthercomprising the structure: ##STR14## wherein each of U, G, A and Crepresents a ribonucleotide and wherein each * represents a base pairand wherein each straight line represents a phosphodiester bond.
 11. Acomposition which comprises the compound of claims 10, in associationwith a pharmaceutically acceptable carrier.
 12. An engineered DNAmolecule coding for the compound of claim
 10. 13. A vector operativelylinked to expression control sequences containing a nucleotide sequencewhich on transcription gives rise to the compound of claim
 10. 14. Acomposition which comprises the vector of claim 13 in association with apharmaceutically acceptable carrier.
 15. A composition which comprisesthe vector of claim 13 in association with a pharmaceutically acceptablecarrier.
 16. The vector of claim 13, wherein the vector is capable ofself-replication in a host.
 17. The method of claim 6, wherein the RNAtarget sequence is an HIV-1 viral RNA target sequence.
 18. The method ofclaim 17, wherein the HIV-1 viral RNA target sequence is a gag proteinRNA target sequence or a tat protein RNA target sequence.
 19. The methodof claim 6, wherein the HIV RNA target sequence is a HIV 5' leader viralRNA target sequence.
 20. The method of claim 19, wherein the HIV viralRNA target sequence comprises a 111/112 site in an HIV 5' leader viralRNA target sequence.
 21. The method of claim 6, wherein the inactivationof the target RNA occurs in a host cell.
 22. The method of claim 6,wherein the inactivation occurs in a host cell which has beentransformed with a vector comprising a DNA sequence coding for thecompound, the DNA sequence being operatively linked to expressioncontrol sequences.
 23. The method of claim 21 or 22, wherein theinactivation of the target RNA occurs in a prokaryotic host cell. 24.The method of claim 21 or 22, wherein the inactivation of the target RNAoccurs in a eukaryotic cell.
 25. The method of claim 23, wherein thehost cell is an animal cell.
 26. The method of claim 23, wherein thehost cell is a plant host cell.
 27. The method of claim 23, wherein thehost cell is a human cell.